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DESCRIPTION 

DIAGNOSIS OF DISEASE STATE USING MRNA PROFILES 
BACKGROUND OF THE mVENTiON 

A. Field of lhb Invention ; 

The present invention relates generally to the detection and diagnosis of human disease 

states and methods relating thereto. More particularly, the present invention concerns probes and 

■ • .■"*■. 

methods Useful in diagnosing, identifying arid monitpring the progression of disease states through 
10 measurements of gene products. 

B. Description of the Related Art 

Genetic detection of human disease states is a rapidly developing field (Taparowsky et aL, 
1982; Slamon et aL y 1989; Sidransky etal y 1992; Miki et al y 1994; Dbng et al y 1995; Morahane/ 

15 aL 9 1996; Lifton, 1996; Barinaga, 199(5). One advantage presented by this field is that certain 
disease states ma^ be detected by non-invasive means, e.g. sampling peripheral blood or amniotic 
fluid. Affected individuals may be diagnosed early in disease progression^ flowing more effective 
patient management Wi th better clinical outcomes. 

Some problems exist with this approach^ ;A number of known a gei>etic lesions merely 

20 predispose to development of specific disease states. Individuals carrying the genetic lesion may 
not develop the disease state, while other individuals may develop the disease state without 
possessing a p^icular genetic 1^ In hum^ cancers^ potentially occinr in a 

large number of known tumor suppresser genes and proto^n^ogoies. 

The genetic detection of cancer has a long history. One of the earliest genetic lesions 

25 shown to predispose to cancer was tramforniing point mutations in the ray oncogeiies (Taparowsky 
et al, 1982). Transforming fas point mutations, may he detected in the stopl of individuals with 
benign and malignant colorectal tumors (Sidransky et a/., 1992). However, only 50% of such 
tumoKjeontained a ras mutation (Sidransky et a/., 1992), Similar results have been obtained with 
amplification of HER-2//?ew in breast and ovarian cancer (Siairion et al, 198$), deletion and 
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mutation of p53 in bladder cancer (Sidrarisky et al y 1991), deletion of DCC in colorectal cancer 
(Fearon et al , 1 990) and mutation of BRCA J in breast and ovarian cancer (Miki et al , 1 994). 

None of these genetic lesions are capable of predicting a majority of individuals with cancer 
and most require direct sampling of a suspected tumor, making screening difficult Further, none of 

5 the markers described above are capable of distinguishing between metastatic and non-metastatic 
forms of cancer. In effective management of cancer patients, identification of those individuals , 
whose tumors have already metastasized or are likely to metastasize is critical. Because metastatic 
cancer kills 560,000 people in the US each year (ACS home page), identification of markers for 
metastatic cancer, such as metastatic prostate and breast cancer, would be an important advance. 

10 A particular problem in cancer detection and diagnosis occurs with prostate cancer. 

Carcinoma of the prostate ^CA) is the second-most frequent cause of male cancer-related death in 
the United States (Boring, 1993). The incidence of prostate cancer increased by 50% between 1980 
and 1990 (Stone et al, 1994). Although relatively few prostate tumors progress to clinical 
significance during the lifetime of the patient, those which are progressive in nature are likely to 

1 5 have metastasized by the time of detection. Survival rates for individuals with metastatic prostate 
cancer are quite low. Between these extremes are patients with prostate tumors that will 
metastasize but have not yet done so, for whom surgical prostate removal is curative. 
Determination of whibh group a patient falls within is mtic^ in determim 
patient survival. 

20 Genetic changes reported to be associated with prostate cancer include: allelic loss (Bova, et 

al, 1993; Macoska et al, 1994; Carter et ial, 1990); E>NA hypermethy lation (Isaacs et al, 1994); 
point mutati6ns br deletidiis of the retinoblastoma (Rb) and pS3 genes (BoOkstein e/ til , 1990a; * 
Bookstein et al, 1990b; Isaacs et al, 1991); and imeup chromosomes 
detebtedby fluorescence in situ hybridization (FISH) (Macoska et al , 1 994; Visakorpi al, 1994; 

25 Takahashi et al, 1994; Aleiiraze/ al, 1994> 

A recent developnlent in this field was the identification of a pipstate rhetastasis suppressor 
gene, (Dong a/., 1995); Iri^^ 

caused a sigiiific^ht decrease in metastatic tumor formation (Dong e/ ^/., 1995). However, 
detection of Thus, 
30 either a primary prostate tittnor must be sampled or el^ sufficients^ present 
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in blood, lymph nodes or other tissues to detect the trussing or abnormal gene. Further, the 
presence of a deleted gene may frequently be masked by large numbers of untransformed cells that 
may be present in a given tissue sample. 

The most commonly utilized current tests for prostate cancer aire digital rectal examination 
5 (DRE) and analysis of serum prostate specific antigen (PSA). Although PS A has been widely used 
as -^clinical marker of prostate cancer since 1 988 (Partiii & Oesterlihg, 1 994), screening programs 
utilizing PSA alone or in combination with digital rectal examination have not been successful in 
improving the survival rate for men with prostate cancer (Piirtirt & Oesterling, 1 994). While PSA is 
: specific to prostate tissue, it is produced by normal and benign as well as malignant prostatic 
1 0 epithelium, resulting in a high false-positive rate for prostate cancer detection (Partin & Oesterling, 

^T994):; - 

Other markers that have been used for pro$tMe cahcer detection include prostatic acid 
phosphatase (PAP) and prostate secreted protein (PSP). PAP is secreted by prostate cells under 
hormonal control (Brawn et al y 1 996). It has less specificity and sensitivity than does PSA. As a 
15 result, it is used much less now, although PAP may still have some applications for monitoring 
metastatic patients that have failed primary treatments. In general, PSP is a more sensitive 
biomarker than PAP, but is not as sensitive as PSA (Huang et ah , 1 993). Like PSA, PSP levels are 
frequently elevated in patients v^^B^ 

Another serum marker associated with prostate disease is prostate specific membrane 
20 antigen (PSMA) (Horoszewicz et ah , 1987; Garter etiah , 1 996; Murphy e/ ah, 1996). PSMA is a 
Type il cell membrane protein and has/been identiffeid Koli6 Acid Hydrolase ^ 
1906; Carte* et ah 9 1996). Antibodies against PSMA*e&0t with/both normal prostate tissue and 
* Estate cancer tissue (Horoszewicz et ah, 1987). Murphy et ah (1995) used EUSA to dete6t 
serum PSMA in advanced prostate cancer. As a serum test, PSMA levels are a relatively poor 
25 indicator of prostate cancer. Hq>ye^ r P^ circumstances. PSMA 

• is expressed in metastatic prostate tumor capillary beds (Silver et ah, 1997) and is reported to be 
more abundant in ^ blood of PSMA 
messenger RNA (rtiRNA) is ddWn^gulated 8^10 fojd in cancer cell line 

after exposure to 5-a-dihydroxytesto^ 1994). 
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and fih „ )bta . ( Sai«o c , „,,. 1994 , devious ^^^^^^ 

opre E u la , i()n „ r „ . s . n . te ° f (I993 > «*• Northern bio, analysis ,o-show 

"-ease 8 T myel ~ 1 — reund an 

Sch.ibc n bn 1 .e„„ <rf . (I ,9S)obs=rv2 ^" Wfi ^ **^«<0 .reagent 

Vi ./vj; observed a correlation between II -« „ 

— — — , r „. T!r !!r r ' " 

„ The ,.,.„„„, disc.osure is *e fts, ,„ c^bine^e.sn^ if * 8 „ , 
prostate tumor. aavanced from localized forms of 
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Moon of anonna, sobjec, (a l^dMd^ ' 

■dnal). An .mportan, advanage provided by fte present 
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invention is that a disease state may be detected, diagnosed, or a prognosis may be derived by 
examining a blood sample rather than relying on a more invasive, or less sensitive test In 
addition, a subject may be monitored for disease progression, siktus land response to therapies 
through monitoring of differentially expressed disease Markers In certain embodiments of the 
5 present invention at "patient" "individual" or "subjecf * may be an animal, including a laboratory 
animal or other anifnakspecies, or in certain embodiments a Human subject. 

Ih certain embodiments of the invention the terms "expression", "gene expression" and 
"expression products" may refer to either production 6f a; hiaarker geiie RNA message or the RNA 
message produced or- both. In certain bther einbbdimehts of the invention the terms 

10 "expression" "gene expression" and "expression prddticts" miay refer to either translation of a 
marker RNA message "into proteins, polypeptides arid/or peptides, arid/or to the produced 
proteins, polypeptides, and/or peptides. % certain aspects of iiie invention a marker may be a 
gene whose expression is activated to a higher level in a patient ^ufFering a disease state, relative 
to its expression in a healthy subject. It is als6 uridetstotid that a 4ififerentialiy expressed marker 

15 may be eitlier activated or inhibited at the nucleic acid level or protein level, or may it miay , 
subject to Alternative splicing to result in a different polypeptide product. Such differences may 
be evidenced by a change in irtRNA levels, surface expression, secretion or other partitioning of 
a polypeptide, for example. In certain aspects of the invention, a marker ihay be a comparison of 
expression between tWb or more marker genes, and/or ai comparison of the ratios of the 

20 expression between two or more maricer genes, or- evfen a comparison 'W twd differently 
proce^slerf products of the same gefie, Wfiich drfei^ between healthy ^bfeets and subjects 
' ~ siiffeniig ^ diseaSeii State. 7 a 

As "demonstrated in the examples included Befeirif the present inventors have identified 
certani m^i^ ^ m^ods of idehtif^g barkers that &vb bera ^ detection of 

25 v niet^tid prostate afad toetkstatic breast eaiM^er! These^ex^niples haire detoonstfatetfthat disease 

• M% detected ^d ^dMtdred by MrvfcjfagThfc re^hsfe cff healthy iiimimie cells to the 
disease condition. As such, this novel rriethc^^ of 

* ^arifefei^ tfeat are differentially expressed in respcMse'tb o&ei^^ as other 
di&al&s such as ksthm^ Inptis efythe^ 
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gravis, autoimmune thyroiditis, amyloid lateral sclerosis, interstitial cystitis, prostatitis or other 
systemic or chronic conditions. 

In a certain embodiment of the present invention, the inventors have demonstrated the 
ability to detect and discriminate between benign prostatic hyperplasia (BPH) and prostate cancer, 
5 using multivariate analysis with several different prostate disease markers. By combining test 
results for serum prostate specific antigen (PS A) and IL-8 gene products, it is possible to identify a 
. significant proportion of individuals \yith prostate cancer, while achieving close to- one hundred 
percent accuracy in differentiatingbetween individuals presenting with prostate cancer versus BPH. 
These levels of sensitivity and specificity represent significant advances over the prior art in 

10 prostate cancer detectipn and differentiation, which traditionally have been petfcnmed with 
univariate analysis with PSA, digital rectal examination and other techniques. It is further disclosed 
that levels of IL-8 gene product in the peripheral circulation may be used to discriminate advanced 
from localized stages of prostate cancer. 

It is an important aspect of the present invention that it is the response pf the normal 

15 blood lymphocytes that is being examined, rather than the prostate, breast or other disease cells 
themselves as in previous methods. As an aspect of the invention, certain mRN As are identified 
that are differentially expressed in normal cells, as a reaction to a disease state, relative to their 
expression in healthy subjects. Two of the metastatic ^ cancer-markers disclosed h^in present 
previously unreported gene?, with one of the two matching a small expressed sequence tog (EST) 

20 described in Genebank Accession # T030 1 3 and SEQ ID NO: 1 , and another iriatching Ae sequence 
disclosed in ^ SEQ IE) NO:2. Anothermaricer corresponds t<> the sequence of elongation factor 1> 
alpha (Genebank Accession # X03558 and SEQ ID NO:3). Two oth« markers rg>resent 
alternatively spliced forms (Genebank Recession # M28130 and S£Q ID NO:5; Genebank 
Accession 3 Y00787 and SEQID^O:4) of m^A from the IL-8 (int£rie*0an 8) gene. One 

25 meta^tic cancer marker is a previously ^characterized gen^; (SEQ ID ^Ol29) that, has homology 
to a nura^arpf previously identified EST sequences^ while another marker is a preyi<|asly id^tified 
g;ene sequence (KA00026^ ^ 
;The ^ 

(PS^; pro^t? v ^R(^ific ^eanbrane janfigen (PSMA=Folic Acid Hydrola^); pr^^ te acid 
30 phosphatase (PAP); prostatic secretory proteins (PSPc*); human kallekrein 2 (HK2); and the ratio 
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of the concentrations of free and bound forms of PSA (f/t PSA), in combination with ainy of the 
markers identified herein as SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:4, SEQ ID 
NQ:2 or SEQ ID NO:29, and the sequences identified in Oenebank Accession #s D8745 1 , T0301 3, 
X0355 8, M28 130, and Y00787, their complementary nucleic acid sequences, or their expression 
5 products may be used in all embodiments using of detecting the markers in any of llii^ methods 
"tJisdiosed herein or known in the art. In the examples disclosed hereiii, tfie differeiitial expression 
of marker genes is detected by RNA fingerprinting methods, however, differential expression 
detected by Shy other means, including but hot limited to other RNA fingefprihting methods, 
Northern blotting, ^ iiriniiinodetection, pfotein-J>r6tein interactions, biological activity or other 
1 0 Methods known in the art would fall within the Scope of the present invention. ; 

^ The presefit disclosure is the first report of an alternatively spliced form of IL-8 mRNA thai 
includes intf on 3. In the peripheral blood of notrrial individuals the mRNA transcript containing 
intron 3 (Genebank Accession # M28 i 30) is moire abundant than the previously reported spliced 
form from which intron 3 is missing (Genebank Accession # Y00787). Surprisingly, in patients 
15 with metastatic prostate cancer fee previously ireported spliced form is much more abundant, with a 
seven-fold increase compared to riormal individuals. In contrast, the transcript containing intron 3 
is approximately Seven-fold less abundant in patients with metastatic prostate cancer than in normal 
"-".individuals." ""' ■": -I ■ " l ."' 

The substantial change in levels of alternatively spliced mRNA species in the peripheral 
20 blood 6f individtiais with metastatic cancer provides a simple and effective diagnostic test for the 
presence of cancer metastases, that is unaffected by problems in sampling pritnary tumors or the 
taasking ihfluence i6f norinal cells ih a tissue sample* It therefore represents a si^ficant advance 
ov€f previous methods for detecting and diagnosing metastatic Cancer in humans. The skilled 
practitioner Svill realize that metastafic cancfer det^tion and diagnosis may be performed by 
25 quantitative analysis of either the IL-8 ntRNA transcripts themselves or their protein products. : 

/ The present disclosure represents a substantial unexpected advance over pitivious 

' fctfowted&e in this field. It reports a tiovel Sliced fonii df IL--S niRMA that is repressed in 
metastatic prostate cancer. It provides a sensitive meahV for detecting metastatic cancer by 
: liieasiiring tiie levels the twa aitem It provides a highly 

30 sensitive and specific inethod for detecting and differentiating between BPH, localized and 
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adyanced forms of prostate cancer by combining detection of IL-8 gene product with other markers 
of prostate disease. 

The present disclosure further demonstrates the feasibility of detecting and diagnosing 
human disease states in general by monitoring changes in the expression of specifib genes in 
5 peripheral lymphocytes. The skilled practitioner of the art will realize that such a technique has 
widespread applicability for screening of asymptomatic mdiyiduals for disease state marikeis. 

The identified disease state markers may in turn be used to design specific oligonucleotide 
probes anci primers. In certain preferred embodiments the term "primer" as used here includes any 
nucleic acid capable of priming template-dependent synthesis of a nascent nucleic acid. In certain 
10 other embodiments the nucleic acid may be able to hybridize a template, but not be extended for 
synthesis of nascent nucleic acid that is complementary to the template. As used herein a "primer" 
may be at least about 5, about 6, about 7, ^bout 8, about 9, about 10, about 1 1 , about 12, about 1 3, 
about 1 4, about 1 5, ^bout 1 6, about 17, about 1 8, about 1 9, about 20, about 21, abfcut 22, about 23, 
about 24, about 25, about 26, about 27, about 28, about 29, about 30, about 35, about 40, about 50, 
15 about, 75, about 100, about 15Q, about 200, about 300, about 400, about 500, to one base shorter in 
length than the template sequence at the 3' end of the primer to allow extension of a nascent nucleic 
acid chain, though the 5' end of the primer may extend in length beyond the 3' end of the template 
sequence. In certain embodiments of the present invention the term "template" may refer to a 
xiueleic acid that is used in the creation of a complementary nucleic ^cid strand to the "template" 
20 strand. The template may be either RNA and/pr DN A, and the complemeJi^aiy sb^^ may also be 
RN A and/or E>NA. In certain emh^iments the complementary strand may comprise v all or part of 
the complementaiy sequence to tibe template, and/or may include mutations so that ft is not an 
^, h : exQCt; complementary strand to the template. Strands that are not exactly coimplemetitaiy to the 
■ template strand may ^ hybridi2»:specifically to the template strand in detection assays described herej 
25 as well as other assays knpwn in the art^ and such complementary strands that can be used in 
/ detectipnas^y - 

A^ett atoplifi^ and 

^ P blood samples. In certain aspects of ihe iiiyentioni 

^-^ipb^s^ "aitiplificatic>n7 m^y refer to any method or technique known in the art or described herein 
... for a target nucleic.apid or its 
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complement. In certain Aspects of the invention, the tertn ''amplicon" refers to the target sequence 
for amplification, or that part of a target sequence that is amplified, and/or the amplification 
products of the target sequence being amplified. In certain other embodiments ain "amplicon'- may 
include the sequence of probes or primers used in amplification. This analysis assists physicians in 
5 ^detecting arid diagnosing ihe disease state and in determining optimal treatment botirses for 
individuals at varying stages of disease state progression. 

In light of the present disclosure, one of ordinary skill in the art will select segriierits from 
the identified marker genes for use in the different detection^ diagnostic, or prognostic ihethods, 
svector constructs, antibody production, kit, and/or any of the embodiments described herein as part 
10 of the present invention. Marker gene sequences include those published in the Genebank database 
^tiiat match the identified marker genes: Genebank Abcessiori riumbdrs D8745 1 , T030 1 3, X03558, 
M28130, and Y00787, as Well as the sequences disclosed herein as SEQ ID n6:1 , SEQ ID NO:2, 
SEQ ID NO:3, SEQ IDN0:4, SEQ ID NO:5, and SEQ ID NO:29, which also irifciude sequences 
for previously unchamcterized marker genes (UCPB 35, SEQ ID NO: 1 ; UC 302, SEQ ID NO:2; 
15 UC 331, SEQ ID NO:29) identified herein. For example, in certain embodiments in which one 
nuiy be practicing the present invention for the identification of a disease marker, for example, the 
sequences selected to design probes and primers may include repetitive stretches of adenine 
nucleotides ^ly^tail^^ 

In certain other embodiments, probes ^irid primers may be ^febific^ly d^i^ed to nbt iiicltide these 
20 or other ^griiehts frorii the identified tfiaikCT genes, as brie of ordiiii^ sMllediri • 
certain^ 

For exariiple, \Vhere ai gfenoiftic sequence is disclosed, one would use seqiiences that 

metastatic ckneer marker includes alternately spliced tihanscripts so that intrdnic sequences iiiay be v . 
25 used for diagnostic or prognostic purposes (Genebahk Accession # M28130). Extoti ^qnituses in V 
the gene Structure, as de^feirTthe^ 

4 1482 to 1647^24^ to 2599, 2fc?l to 2954, ^d 3370 to423& jitfftm 3 includes bases 2954 to 
3370. One of ordinary skill in thw art tiiay^tect sejgn^rits ftoin ^e published exon sequences, or 

30 ^ e 
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which intron 3 is missing (Genebank Accession # Y00787) may be used. By choosing or selecting 
the sequences to include or exclude intron 3, one could preferentially detect expression of one 
alternatively spliced form -.of IL-8, or even the ratio of the two forms using the methods disclosed 
herein. One of ordinary skill in the art may select and/or assemble segments from any of the 
5 identified marker gene sequences into other useful forms, such as coding segment reconstruction's 
of mRN A sequences from published genomic sequences of the identified marker genes, as part of 
the present invention. Such assembled sequences woul4 be useful in designing probes and primers, 
as well as, providing coding segments for protein translation, for detection, diagnosis, and prognosis 
embodiments of the invention described herein. 

10 ... For example, primers to detect the message of 1L 8 using the transcribed portions of the 

marker sequence as set forth in the listing in Genebank Accession # M28130 may hybridize to 
nucleotides 1482 tp 1503 and the complement of nucleotides 1626-1647., The^ 
would amplify a segment of message of the marker gene 166 base pairs in length. Primers 
designed to nucleotides 1482 to 1 503 aijd the complement of nucleotides 2464 to 2483 would 

15 „ simplify a segment of message of fee marker gene 186 base pairs long in messages that have the 
intervening intron between nucleotides 1648 to 2463 removed. Thus, one skilled in the art would 
be able to calculate the expected size of traascrit^ed sequences from marker genes identified herein 
• iyhpse sequences are published either as genomic; sequence, mRN A, or cDNA, a$ well as the 
sequences; disclosed herein, taking, into account the differences in size of the products produced 

20 cl^ending^ on . the presence absence of intronic sequences. In preferred embodiments, tlie 
differences in size of amplification products using primers designed tp re^pns flapking bpth sides 
of nitron 3 in the IJJ^S marker gene sequences identified (Gfenebank; Acc^ssipn # Y00787 and # 
: r ; ; f M28 1 3Q) , oan |be used in detection* diaj^psis, and/pr prpgppsis of metastatic cancer. However, 
primers designed to regions of IL-8 secjuences that do not flanlc intron 3, or the other marker genes 

25 > that jdo nptjhave different in intron splieing, pr that prime mRN A or cDNA template sequences, 
\voiild not be expected to produce amplificatioi^products that include intronic segments. 

V p 1 to 20 and the cpn^qtient of nwleptides 

; , 20atg ^0 ofSEQ jp NO;l long. 
- , v ^ nucleotides 1 IS to 138 -;a^;|^-;^^lem<^t of nucleotides 730 to 744pf SEQ 

30 r , . n>1^0^9 would am gene segment 630 base pairs long. Primiers designed ^ 
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to nucleotides '102 to 120 and the complemerit of nucleotides 3J51 to 46 1 6f the IL-8 marker gene 
sequence identified in Genebanlc Accession # Y00787 Would amplify a metastatic marker gene 
segment 302 base pairs long that would be approximately sevenfold less abundant in normal 
patients wheii compared to patients with metastatic prostate canter. Primes can be designed to 
5 Bmplify the transcribed portions of the metastatic cancer markers that would ihcliide any length of " -.; 
nucleotide Segment of the transcribed sequences, up to and httludihg the full length of each marker 
gene message. It is preferred that the amplified segmerits of identified marker genes be an 
amplicoh of at least about 50 to about 500 base pairs in length. It is particiiIMy preferred that the 
amplified segments of identified marker genes be an arnplicdh of at least about 400 to about 415 

10 ^se pmi^ ih length, and/orno longer in length than the amplified segment used! to iiomihlize the 
^quantity of iiifessage feeing amplified in the detection assay s described herein, i^xich assays include 
RNA fingerprinting methods, hpweveir, differential expression maiy be detecited by other means, 
and all such methods would fall Within the scope of the present inyeritioh. The predicted size of the 
amplified metastatic cancer marker gene: segment, calculated by the location of thfe primers relative 

1 5 to the transcribed sequbnde, wotild be used to determine if the detected amplification product is 
indeed the rn^ker geirie beiiig ailiplified. Sequencing the amplified or^d band that matches 

the expected size of the jaitipM and comparison of the band's sequence to the known 

ord^cl^ 

" '^pi^^^'Bc^^^-'^ ? ^ '" ;r; .;;-.' : - 

N 20 The identified markers may also be used to identify and isolate full length 3gehe ^uettces, 

' ^ expi^sioh,1ff6m g^ttnwc huiriw'DN^ libraries. The 

■ ; t^^ 

^ r 1 i 

25 hybridi2atioii H ); ^"(2^x^01 & Carbon ^Overlap HybriaM^ S&eehihg: Isolation and " . 

Gene on Yeast 

H 6^m^6M^ mk 5* H ^12^91^9. tMee^l^^ 

eDNA hybridization probe, nonr^^ 
^•■fi^ allowing 
30 — ^ #S^be i^bgni^ed that ^ ( i; 
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full length genes may be: obtained using the small expressed sequence tags (ESTs) described in this 
disclosure, using technology currently available and described in this disclosure (Sambrook et al 9 
1989; Chinault& Carbon, ! 9?9). Sequences identified and isolated by such means may be useful 
in the detection pf the prostate, marker genes using the detection methods described herein, and are - 
5 part of the invention. 

The ictejqtified markers may be used to identify and isolate cDNA sequences. The EST 
sequences identified in the present disclosure may be used as hybridizationprobes to screen human 
cDNA libraries by conventional techniques. It will be recognized that these techniques would start 
by obtaining a high quality human cDN A library, many of which are readily available from 

1 0 coipmeipial or other sources. The library may be plated on, for example, agarose plates containing 
nutrients, antibiotics arid other conventional ingredients. Individual colonies may then be 
transferred^) nylon or nitrocellulose membranes and the EST probes; hybridized to complementary 
sequences on the membranes. Hybridization may be detected by radioactive or enzyme-linkedtags 
associated with the hybridized probes. Positive colonies may be grown up and sequenced by, for 

15 example, Sanger dideoxynucleotide sequencing or similar methods well known in the art 
Comparison of cloned cI>NA. sequences with known human or animal cDNA or genomic 
sequences may be performed using; computer programs and databases well known in the art, 
? Sequences identified and isolated by such, inea^ may be useful in the detection of the prostate 
disease, or other disease marker genes using the detection methods described herein,, apd are part of 

20 the invention. 

; . In one embodiment 

marker genes are incorporated into e^r^io^ -ypctqr^ a^id expressed as 4b.e <: ^coifed proteins or 

} ill— &fptt • Ceding sequences roay be assembled from amino 

25 \ acid .encoding s^ginente of or to tru^ 

sequence, or to use the cc^ii^g sequences or segments thereof in expression vectp^ in 
r ^ th^ enribjodiro^ to express peptides or proteins of 

Such protein? or peptides are in turn ,used , as antigens for induction of monoclonal or 
30 jx>lycIonaI antibody production. Such antibodies may in turn be used to detect expressed proteins 
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as additional markers for human disease states. Antibody-protein binding may be detected and 
quarititated by a variety of means known in the art, such as labeling with fluorescent or radioactive 
ligarids. 

Certain metastatic m^ker genes disclosed herein (SfeQ ID NO: 1 and Geriebarik accession # : 
5 T0301 3; and SEQ ID NO:2) do not havd reading frames for translation disclosed. However, one of 
■ ^ ^rflihaify skill in the ah may translate tKe identified sequences or segments thereof in the three 
potential Tcadini: frames to obtain peptides or proteins for use in generating antibodies to these 
marker penev Such antibodies fiiay be used to purify the proteins of the marker genes, and the 
^identity bf protein being detected is eonfirmed by peptide sequencing of the protein* Once 
10 confirmed a* bindmg the translation products of the marker genes corresponding to SEQ ID NO:l 
^ 1 krid tJcricbbnk accession ti T03b 13, and/or SEQ ID NO:2, the? antibodies that bind the marker gene 
protein would be useful in detecting, diagriosis, or prognosis of meta&atic cancer. 

An example of an marker gene sequence that would be preferred for translation would be 
iritron 3 of 1L-K (Gcncbank Accession # 1V128136). Peptides of polypeptides that contain amino 
15 acid sequences from thi^ ihtron wotild bfe preferred in the creation of polyclonal or monoclonal 
antibodies that preferentially detect forriis of 1L-8 which include intrbii 3. 

In certain aspects of the present invention the terms immunodetection", -iiiiftiunobinding", 
"in^t^ofcaction" vi immun6histochemicai-', "immunosorbent", and "^^immunoassays" refers 
to melhods that concern binding, purifying > removing, quantifying or Otherwise generally detecting 
20 biological components by obtaining a sample suspected of containing a protein, peptide or 
antibody, and contacting the sample with an antibody or proteiff or peptide in accordance With the 
pfiesiefit inveiiti6n; as the case may be, under "conditions effective to allow the formation of 
x 'im^ In certain preferred aspects of the present invention, one obtaiiis a sample 

suspected of containing a disease state-marker encoded pfofeihj peptide^ 6 
25 antibody, and contactsthe Sample with an antibody or ehcoded protein or peptide* as the case may 
' and then detests or quantifies the anidunt of immune complex formed- under the specific 

^febnffitieiis^ The steps of Various \isefuT ih^^odet^on ttethbd^ have been deseiibed in the 
" scientffi^ e.g., Nkkjairilira al (1987); ^ H 

" ^ ^ Iil another eihbodimerit 1 of the present uiVeht^ 

30 hybridi^tiori probes and primers afe specific for disease, state markers ^omprisijlg^isolatedijueleic 
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acids of a sequence comprising the sequences published in Genebank Accession numbers D87451, 
T03013, X03558, M28130, and Y00787, as well as the sequences disclosed herein as SEQ ID 
NO:l, SEQ ID NO:2, SEQ ID NO:3 ; SEQ ID NO:4 ; SEQ ID NO:5 5 and SEQ ID NO:29. Such 
probes and primers may be of any; length that would specifically hybridize to the identified marker: 
gene sequences and may be at least about 14, about 15, about 16, about 17, about 1 8 about 19 
about 20, about 2 1 , about 22, about 23, about 24^ about 25, about 26, about 27, about 28, about 29, 
about 30, about 35, about 40, about 50, about, 75, about 100, about 150, about 200, about 300, 
about 400, about 500, and in the case of probes, up to the full length of the sequences of the marker 
genes identified herein. Probes may also include additional sequence at their 5' and/or 3* ends so 
that they extend beypnd the target sequence? with which they hybridize. Such primers may be used 
to amplify disease state markers present in a biological sample, such as peripheral human blood. 
Amplification increases the sensitivity of various known techniques for detecting the presence of 
nucleic acid markers for human disease. Probes that hybridize with nucleic acid markers for human 
disease may be detected by conventional labeling methods, such as binding of fluorescent or 
radioactive ligands. The availability of probes and, primers specific for such unique markers 
provides the basis for diagnostic kits identifying disease state progression. 

An embodiment of the present invention encompasses a kit for detecting a disease state in a 
biological sample, comprising pairs of primers for amplifying nucleic acids corresponding to the 
marker genes and container for each of these primers. In another embodiment, th^ invention 
encompasses a kit for detecting a disease state in a biological sample, comprising oligonucleotide 
prol^ ^at bincl with high affinity to markers of the disease state and containers for each of these 
prol?es. I|i |ft further ^nbpd^en^ 

bioloj^^ proteins encoded by fljjB nucleic acid markets 

of the disesase state identified in the present invention. 

In one jbro^ isolated nucleic acid of a sequence ; 

cpmpr^^ SEQ ID NO^, SEQ ID NQ:5, SJEJQ ID t$Q:4, SEQ ID NO:2, SEQ ID 

NQ^9,^^ the sequences; identi%d in t Oep^^ Accession #s D87451v T03Q^, X0355S, 
M28 130, and YO0787. The invention further broadly comprises an isolated nucleic acid of between 
17 and i |;00 bases in length, either identical to or complementary with portipns } of the above 
meotipn^ u^^ ^g probes for 

■'" ■ ■ .' -i 
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human disease markers, Or may be used to design probes and primers specific for disease state 
' inarkers. The invention further broadly comprises an isolated nucleic acid of between 17 bases to 
the ftill sequence length, either identical to or boittplemehtaiy with portions of the above rnentiohed 
isolated nucleic acids. 

5 & In another broad aspect, the present invention comprises proteins and peptides with amino 
*Bcid sequeiice^ encoded by the aforementioned isolated nucleic acids. The proteins and peptides 
may be directly detected in the practice of the invention, or used for antibody production. 

The invention also broadly compnses methcxds for identifying biomarkers for use in 

..8;. . ....... .. ....... 

^prbgiidstifc or diagnostic assays of a disease State, using the technique of RNA fingerprinting to 

10 Identify RjSlAs that are differeritially Expressed Ibety^ebn individuals \vith the disease state verstts 
%6riiiaT individuals. hi tKe practice of the method; one may' use random hexamers, arbitrarily 
chosen oligonucleotides, proihiscuous oligonucleotide primers or anchoring primers, as well as 
Oligonucleotide primers specific for known, gene sequences for the reverse transcription step 
and/or for the amplification step. The term ' -promiscuous bligoftticleotide primers" as used 

1 5 herein denotes oligonucleotides that are statistically designed to sample sequence complexity in 
mRNAs, or open reading frames of mRNAs without bias as applied in a PCR based RNA 
fihgerprintihg technique. T^ because such use 

increases the sampling fate of RNA for fing^rintittg by increasing the displayed fingerprint 
complexity. This increases the rate at which differentially expressed mRNAs can be discovered. 

20' The use of promiscuous oligonucleotide primers as disclosed herein will be evident to one of 
""^laaify ' irt .flbte art in fight of the publfcatibn by l^opez^ietb and Nigani, Nature Biotechnology 

- \ b^^tem^me). In certain embodiments 

- " ^ priineirs of riandom or 
l ' ^IS^i^Mbm nucleotide sequence 5 ^fabbuPtf bases in length, though in cert^n embbdimfents the 

25 length of the primers may be of any length previously described for "pmtib&& 9 . In certMh aspects 
f • ^ pnnifers that are selected at the 

^<o^r e^^ to pimeirs as described by 

^ bligtfhubfeotide ^equfehtees ^signed to 

30 bind to specific genes, IL-8 or PSA for example, may also be used in the practice of this method. 
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The present invention may be described in a broad aspect as a method for identifying 
serological markers for a human disease state. The method comprises the steps of providing 
human peripheral blood mRNAs; amplifying the mRNAs to provide nucleic acid amplification 
products; separating the nucleic acid amplification products; and identifying those mRNAs that 
5 arc differentially expressed between normal individuals and individuals exhibiting a disease 
state. The described method may also comprise, in certain embodiments, the step of converting 
the RNAs into cDNAs using reverse transcriptase to detect and quantitate circulating cells 
induced b> the disease slate. In certain embodiments of the invention conversion of RNA into 
cDNAs using Reverse transcriptase is referred to as a "reverse transcriptase*' reaction. Methods 
10 of reverse transcribing RNA into cDN A are well known and described in Sambrook et al 9 1989. 
Alternative methods for reverse transcription utilize thermostable* RNA-dependcnt DNA 
polymerases. These methods are described in WO 90/07641, filed December 21, 1990, 
incorporated herein by reference. In certain other embodiments of the invention a "reverse 
transcriptase^ reaction refers to additional steps of amplification of the RNA template or its 
15 cDNA product. Such step of amplification may include any methods known in the art of 
increasing the number of copies of RNA or DNA, as well as the methods described herein. 
Methods of amplification include the methods described in Davey et aL, EPA No. 329 822 
; ? (incorporated herein by reference in its entirely) , as well as polymerase chain reaction or ligase 
& , chain reaction •■ . . . 

20 The method described in the previous paragraph may be used to discover disease markers 

\ . for any disease -state that affects fee peripheral blood lymphocytes. Such diseases include, but 
■ ^ metastatic or organ defined cancer^ particularly metastatic prostate or breast 

; i multiple ?de*Q«s, myasthenias gravis, 

; aUtoimmune-fe lateral sclerosis (ALS or Lou Gehrig's disuse), interstitial 

25 /:rJ:&y.^$s. or plx>Statitis^ V..---V-...-'- ,.>,..■.;-;.'.■-■' ... ; ( , : ; ^ ;r; -, ;/ ^ r . ' : ' 

b^dec^ed to b acid of a sequence com^ NO:l, \. 

SEQID|^:3i^ 
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' id^fied in Genebarik Accession #s I>»745i ;'TO and Y00787, thereby 

measuring the amounts o5f nucleic acid ainplifibation products formed. 

The invention further broadly comprises' the prbignoisis and/or diagnosis 6f a disease state by 
measuring the amounts of nucleic acid amplification products forined. The amounts of nucleic : 
5 amplifi^ an individual patient may be compared with groups of normal - ^ 

^ndividiials 6r individuals with an identified disease state; Diagnosis may be accomplished by 
finding that the patient's levels of disease state mar^^ 

range observed ih individuals with the disease state. Further comparison with groups of individuals 
Wf Varying disease state progression, subh as metastatic vs; non-metastatic cancer, may provide a 

10 pfogriosis for the individual patient: The invbnti6h'furthCT broadly comprises kits for performiUg 
toe aboVe-mentioiiedpro 

The invention may te described therefore, In certMri bfo^ad aspects as a method of 
detecting a human disease stated comprising the steps of detecting the quantity of a disease 
marker expressed in human peripheral blobd and comparing the quantity of the said marker to 

15 the quantity expressed in peripheral bidcyd of a rioritial individual, where a difference in quantity 
of expression is indicative of a disease state. v In the practice of the method the disease marker 
may preferably be afrmRNA, or even an iriRNA amplified by an RNA polymerase reaction, for 
exafnple. The niiRNA may Mso be amplifi^ fey any other means such as rev^ trans6riptase 
polymerase chain reacti6n oir the lig^fe chaih reaction. The RNA may be detected by ariy means 

20 kuovvn in the art, such as by RNA fiiigerprinting, bi^iiched I>NA 6r a nuclease protection assay, 
Tbt exafiiple. Disease states that may be detected by the Resent inethod include any digfeaster state 
for^^^ 

cancer, asthma, lupus erythromatosis, riieii^ 
25 Tn^ 

1 ^ tl?e sequences or the edmpleiheuts 6f the traWsdfibed sequdnce^ disdosed Mrdn as SEQ ID 

:S ^^0^.; ^q t&Wm, g££fti> nOi5 9 "M^WW^^^SBQ ID **N©:2, W$mtf*M9 aind the 

Of Q&l^^ aproduct of ^ • 
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. The method of detecting a disease state described in the previous paragraphs may further 
comprise the steps of providing primers that selectively amplify the disease state marker, 
amplifying the nucleic acid with said prunes to form nucleic acid amplification products, 
detecting the nucleic acid amplification produpts and measuring the amount of the nucleic acid 
5 amplification products formed. In the practice of certain embodiments of the method, the 
primersf may be selected to specifically amplify a nucleic acid haying a sequence comprising 
SEQ ID NO: 1 , SEQ ID NO;3, SEQ ID NO:5^SEQ ID NO:4,SEQ ID NO:2, SEQ ID NO:29 and 
the sequences identified in Genebank Accession #s D87451, TQ3Q13, X03558, M28130, and 
Y00787. In certain alternate embodiments, the marker may be a polypeptide, and may even be a 
1 0 polypeptide encoded by a nucleic acid sequence comprising a sequence disclosed herein a$ SEQ 
ID NO: 1, SEQ ID N0.3, SEQ ID NO:S, SEQ ID NO:4, SEQ ID NO:2, SEQ ID NO:29 and the 
sequences identified in Genebank Accession #s D87451, T03013, X03558, M2813Q, and Y00787, 
v or it may be described in certain embodiments as a polypeptide encoded by the IL-8 gene. 
Detection of the disease state may be by detection of an antibody immunoreactive with said 
15 marker. It is also an embodiment of the invention that detection may be , by a cellular bioassay, 
that responds to the presence of a biologically active agent such as IL-8, for example* In certain 
embodiments of the present invention a "bioassay" is any assay that measures or detects the 
. .presence of ; a compdxind or efiFector, such as a protein, polypeptide, or peptide product of an 
expressed marker gene, by its affect on a cell, orgasm, or biologically derived reagent or 
20 detection system. Bioassays that r^y be used in the present invention, include, but are not 
limited to, those described in, Schroder et ql s 1990 and Yoshimura et al„ 1989, Kui^owska et al., 
;1997,;H reference),, and all bioassay s known in the 

• ^art that can^be used to detect the exprjeiss^ markers. / 

The present invention broadly comprises production of antibodies specific for proteins or 
25 peptides encoded by SEQID NO: 1 ,SEQ II> NO:3, SEQ ID NO^SEQ 1D NQ:4, SEQ ID NO:2, 
^QH>NO:2^ 

. : n\ Kfl2813% and X0O787, and the use of those antibodies for diagnostic applioajdonsio detecting and 
dia^osing^ 

may be quantitated by conventional xn^thpd^.: eorrplatiop of protein levels \yi A of a 
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' human disease or the progression of a human disease may be accomplished as described above for 
nucleic acid markers of human disease. 

Another broad aspect of the present invention comprises the detection arid diagnosis of 
disease states, including BPH and prostate cancer, by combining measurement of levels of two or 
5 more disease state markers. A broad embodiment of the invention comprises combining 
nieasur^ment of serum IL-8 gene product with other markers of prostate disease, such as PSA, 
PAP, HK2, PSP 94 and PSMA. Yet another broad aspect of the present invention comprises kits for 
detection and measurement of the levels of two or more disease state markers in biological samples, 
¥he skilled practitioner will realize that such kits may incorporate a variety of methodologies for 
10 detection and measurement of disease state markers, including but riot limited to oligonucleotide 
probes, primers for nucleic acid amplification, antibodies which bind specifically to protein 
products of disease state mairker genes, and other proteins or peptides %hich bind specifically to 
disease state marker gene products. 

15 BMEF DESCIUPTION OF THE PI^V^GS 

specification and arQ included to further 
demonstrate certain aspects of the present invention. The invention may be better understood by 
reference to one or more of these drawings in combination with the detailed description^ of 
20 s^ presented herein. 

: ., r ^ FIG, 1 A. Relative quantitative RT-?CR of UC Bands #325-1 (intron 3?) wd 325-2 
(Jntrpn 3+) shpws that the normally spliced form of IL-8 mRNA (intron 3-) is abundantly 
expressed in individuals with metastatic prostate cancer (M) compared with normal individuals 
25 &Q8)i The amplification reactions were Sampled at different cycle numbers. The alternatively 
spliced form oifIL-8 mRNA (in^on 3+) is more abundant in notm^ 
with metastatic cancer. The data were normalized against B-actin inRNA. 

FIG. IB. Relative quantitative RT-PCR of UC Bands #325-1 (intron 3-) and 325-2 
30 (intron 3+) shows that the normally spliced form of IL-8 mRNA (intron 3-) is abundantly 
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expressed in individuals with metastatic prostate cancer (M) compared with a pool of normal 
individuals (N)- The alternatively spliced form of IL-8 mRNA (intron 3+) is more abundant in 
norma! individuals than in patients with metastatic cancer. The data were normalized against 6- 
actinihRNA. 

: FIG. 2, Ability of total PSA (t-PS A) to distinguish BPHatid Stages A, B, & C prostate 
cancer. 

FIG. 3. Ability of corrected free/total PSA (fit PSA) ratio to distinguish BPH and Stages 

A, B> & C prostate cancer. 

FIG, 4. Ability of UC325 (IL-8) to distinguish BPH and Stages A, B, & C prostate 
cancer. : 

FIG. 5. AbiU^;of UC325 (1L^8) and t-PS A combined to distinguish BPH and Stages A, 

B, & C prostate cancer. 

FIG. 6: Ability of Ud3^ to distinguish BPH and 

Stages A, B, & C prostate cancer. "V, 

FIG. 7. Relative quantitative RT-PCR™ showing that UG331 mRNA is^tigMy^^ 
times more abundant in the peripheral blood of individuals with recurrent metastatic breast or 
prostate cancer compared niRNA levels from hedthy^olunieers. PGR™ amplification 

of a UC3 3 1 specific cDNA fragment was performed using the same pools of fcactin normalized 
cDNAs as templates. PGR!™ reactions were tei^h^t(^ aAer either 15, or 3 1 cyH». J?odls of 
ct)NAs were constructed from peri^e^ blood RNAs from eight h&ihJiy vbluhf eers (N), tea 
individuals with recurrent metastatic prostate cancer (P), or ten individuals with i^iirieiit ? 
metastatic breast cancer (B). iWintemi^bf tfre^ prop^ori^ to therelativb ambunts 

of UC33 1 mRNA in the individuals from which these cDNA pools were constructed. 
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FIG. 8. PGR™ amplification of a UG332 specific ebNA fragment using 
of hofmalifced cDNAs as templates. PCR™ reactions were terminated after either 25, 28 or 3 1 
cycles. Pools of cDNAs were constructed from peripheral blood RN As from eight healthy 
volunteers (N), ten individuals with recurrent (P), or tein individuals 

£ witti recurrent niertastatic breast cancer (B). The ihterisity of the bands are proportional to the 
relative amounts of UC332 mfcNA in the ibdividimls firbm wlticti i these cDNA pools were 



cdhstructed. 



10 



Terms used: . . * ■ 

HK2: , human kallekrein 2 gene product , 
PAP: prostatic acid phosphatase 

PSA: prostate spepific antigen 



15 PSMA: ,, 



94 

t-PSA:; . ■ • , _ tbtal PSA 



f/t (Free/Total PgA): ratio of free to total PSA, measured in serum specimens witfc moderately 

elevated t-rPSA ^ ^ . 

20.:-^^^r v -r ; ;-^ r ^-,* * ~ Interleukin-8XUC 325) , , , - ;> ,-4 ^r«:. t - . ' - 

v , : vS^§J^ny^TY =^(Wme Positiyes/(Tnie Positives -ft F^se JN€®^ves)i ^lq^ on ^flDC^ of ROC 
• -■• -f/i4'">sii * curve, { . v i . % ■ v . . ; ■ . > , 

n - ^ Negatiyes)/(True Negatives + Fal$e Positiyes); plqtted on x^?cis (as 7- 

■■'' •- ?f V >"-' '...:^rv^x..^.^v.v % ,-.-. %eci/?cio^^ ..,:, ; ;-.v,. = ,.. 

25^-.- — ; CtiEy^; a means of plot^g r ^j^iti\aty aind 

specificity overara^ 
; ^ : benign prostate hyperpl^a.(orhyF^moi)hy) 

' & - -.mt r££lfcvusa ^ - * • t^.. aden'ck^cmoma-pf.tibe.--.prostate.. .^vi^,. . " 4 . • •,. 

St^eA CaP: organ-confined clinical stage of prostate cancar in which tumor is not 

30 palpable by a digital rectal exam (DRE) (Walsh & Worthington, 1995). 
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Stage B CaP: organ-confined clinical stage of prostate cancer in which tumor is palpable 

by a digital rectal exam and involves one or both lobes of the gland 

(Walsh& Worthingion, 1995). 
Stage C CaP: non-organ-confined clinical stage of prostate cancer in which tumor is 

palpable by a DRE and invades beyond the capsule and/or the seminal 

vesicles (Walsh & Worthington, 1995). 
Stage D CaP: non-organ-confined clinical stage of prostate cancer characterized by 

metastases to lymph nodes, bone or other distant organ site (Walsh & 

Wprthington, 1995). 

The present invention concerns the early detection, diagnosis, and prognosis of human 
disease states. Markers of a disease state, in the form of isolated nucleic acids of specified 
sequences from the peripheral blood of individuals with the disease state, are disclosed. These 
markers are indicators of the disease state and aire diagnostic for the presence of the disease state in 
patients. Such markers provide considerable advantages over the prior art in this field. Since they 
are detected in peripheral blood samples, it is not necessary to suspect that an individual exhibits 
the disease state before a sample may be taken. The detection methods disclosed are thus suitable 
for widespread screening of asymptomatic individuals. Further, the methods provide for sensitive 
detection of disease state markers that is relatively unaffected by the pteserice of normal, non- 
diseased cells in a biological sample such as peripheral blood. 

It will be apparent that the nucleic acid sequences disclosed will find utility in a variety of 
applications in disease state detecto, diagnosis, j^^dsfe treatment; M^>1^ of such 
applications within the scope of the present disclosure comprise amplification of markers of the 
disease state using specific primers, detection of markers of the disease state by hybridisation with 
oligonucleotide probes, incorporation of isolated nucleic acids into Vectors, expression of vector- 
incorporated nucleic acids as liNA arid protein, arid development of immunologic reagents 
coirespondingtom^ker 

It is important to iiotfet th^t l)€r$2$ ? {$U8) serology in combination with PSA afrc££ft PSA 
can rpore accurately differentially diagnose jprostate c^cer and fePH. This method provides 
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si^ificaht advantages over ptevious methodologies for detecting: prostatic cancer, which often 
failed to differentiate between prostatic cancer and BPH. 

A. Nucleic Acids ' ^ ' ' 

5 As described in bxamplcs 1 through 4, the present disclosure provides five markers of a 

disease state, identified by RNA fingerprinting. Thdsfe include two previously previously 
urichafacterized gene products, as well as nubleic acid products of the IL-* (interieukiri 8) and 
human elongation factor I -alpha genes. 

In one embodiment, the sequences of isolated nucleic acids disclosed herein find utility as 
10 hybridization probes or ampfi lication • primefsJ These nucleic acids may be used, for example, in 
^tagriO^c ev^luatioiii of tissue samples or employed to clone full length cDN As of geriorriic clones 
Corresponding thereto - In certain embodiments, these pirobes and primers compri^ oligonucleotide 
fragments. Such fragments are of sufficient length to provide specific hybridization to an RNA or 
DNA sample extracted from tissue. Thd sequences typically will be 10-20 nucleotides, but may be 
15 longer. Longer sequences, eg., 40, 50, 1 00, 500 and even up to fiill length, are preferred for certain 
embodiments. r 

Nudieic ^id fe^ 15/17, 20, 30, 40, 50, 60, 

75 or 1 00 Or 500 f nucleotides of a sequence comprising Genebarik Accession numbers D87451, 
t03OI^Xti3^^ 

20 SEQ IB NO:5, or SEQ ID NO 29 are contemplated. Molecules? that are complementary to the 
above mentioned sequences arid that bind to tiiese sequences undet high" stringency cofrdkioias are 
also contemplated. These probes are usef#m&%irife^ as 
^"Sbutherii afid northern blotting In some c^es, it is coi^m^ used that 

* hybridize to multiple target secperifces without ^ 
25 ttte fc^sestete; * - ^' ' M v 

" 1 ^V*tfi6tis j>krtSes? and primers may be d<^gfoe<i arourid the disclosed nucleotide sequences. 

but, ^ically, are l#a0 bases iii length. By assigning iumeric 
values to a ^equenofe, for Example, the first rilsidue is 1^ the second residufcis % vtc^> ^^gorithm 
r K: defining ^primfers may be proposed: : * •••« ■ . ; s - ^ 

30 ntoh + y 
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where n is an integer from 1 to the last number of the sequence and y is the length of the primer 

minus one (9 to 19), where n + y does not exceed the last number of the sequence. Thus, for a 10- 

mer, the probes correspond to bases 1 to 10, 2 to 11, 3 to 12 ... and so on. For a 15-mer, the probes 

correspond to bases 1 to 15, 2 to 16, 3 to 17 ... and so on. For a 20-mer, the probes correspond to 

bases 1 to 20, 2 to 21, 3 to 22 ... and so on. 

The values of n in the algorithm above for each of the nucleic acid sequences is: SEQ ID 
NO:l,n=-253; SEQ ID NO:2,n= 183; SEQ m NO:3, n= 387; SEQIDNO:4,n= 366; SEQ ID 
NO:5,n=598. f ^ 

In certain embodiments, it is contemplated that multiple probes may be used for 
hybridization to a single sample. For example, an alternatively spliced form of IL-8 mRNA, w 
containing intrqn 3, may be detected by probing: human tissue samples with oligonucleotides 
specific for intron 3 and for exon portions of the IL-8 transcript. Hybridization wtilrtbe intron 3 
and exon sequences probe would be indieatiy&of a normal individual and binding to only the exon 
probe would be indicative of metastatic prostate cancer. 

The use of a hybridization probe of between 17 and 100 nucleotides in length allows the 
formation of a duplex molecule that is both stable and selective. Molecules having complementary 
sequences over stretches greater than 20 bases in length are generally preferred in order to increase 
stability, and selectivity of the hybrid* ^4 improve Ae qud^ of hybrid 

molecules. It is: generally preferred to d^i^ nuqlerc acid to 30 

nucleotides^ ©r eyen longer. Such fragments may be readily prepared l^y, for example, directly > 
u$0*^sizing Jhe- fragment t>y <*jemi^..m by introducing selected sequences into 

- The coniplgn^ent of a nucleic acid sequence is well known in tlje art and js, based on the 
3$iiti~par^ of nucleotides (bases) for a -ghfen ni^ljeic aeid tpcjlymer 

(strand). Two complementary strands of DNA are formed into a duplex by.j^airin^ pf /bases, 
^G^lp^^ ofRNA) and all 

<OT "13^ tp 7A^h reverse 5*- 

i^^oopJcment^' defines a Second str^d of^ of 
nucleic acid to form a duplex molecule in which base pairs, are matched .as ^ G^ : C^jA:T/U or 
TVU:A. 
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A compfembnt may also be described iis a fragment of t)N A (nucleic acid segment) 1 or a 
synthesized single stranded oligomer that may contain small mismatches or gaps when 
hybridized to its complement, but that is able to hybridize to the complementary Dlsf A under 
high stringency conditions. To hybridize is understood to mean the forming of a double stranded 
5 riiolecuic or a molecule with partial double stranded nature. High stringency conditions are those 
that allow hybridization between two homologous nucleic acid sequeiices, but precludes 
hybridization of random sequences. For example, hybridization at low temperature and/or high 
ionic strength is termed low stringency. Hybridization at high temperature and/or low ionic 
strength is termed high stringency. Low stringency is generally performed ait 0.15 M to 0.9 M 

10 NaCl at a temperature range of 20°C to 50°C. High stringency is; generally performed at 0.02 M 
to 0 AS M NaCl at a temperature range of 50°C to 70 b C. It is understood that the temperature 
and ionic strength of a desired stringency are determined in part by the length of the particular 
probe, the length and base content of the target sequences, and to the presence of formamide, 
tetramethylammonium chloride or other solvents in the hybridization mixture. It is also 

15 understood that these ranges are mentioned by way of example only, and that the desired 
stringency for a particular hybridization reaction is often determined empirically by comparison 
to positive and negative controls. 

Accordingly, the nucleotide sequences of the disclosure may be used for their ability to 
selectively form duplex molecules with complementary stretches of genes or RNAs or to provide 

20 primers for amplification of DNA or RNA from tissues. Depending on the application envisioned* 
it is preferred to employ varying conditions of hybridization to achieve varying degrees of 
selectivity of probe towards target sequence. 

For applications requiring high selectivity, it is preferred to employ relatively stringent 
conditions to form the hybrids. For example, relatively low salt and/or high temperature 

25 Conditions, such as provided by about 0.02 M to about 0.10 M NaCl at temperatures of about 50°C 
to about 70°C. Such high stringency conditions tolerate little* if any, mismatch between the probe 
and the template or target strand, and would be particularly suitable for isolating specific genei or 
detecting specific mRNA transcripts. It is generally appreciated that conditions may be rendered 
more stringent by the addition of increasing amounts of formamide. 
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For certain, applications, for example, substitution of amino acids by site-directed 
mutagenesis, it is appreciated that lower stringency conditions are required. Under these 
conditions, hybridization may occur even though the sequences of probe and target strand are not 
perfectly complementary, but are mismatched at one or more positions. Conditions may be 

5 rendered less stringent by increasing salt concentration and decreasing temperature. For example, a 
medium stringency condition may be provided by about 0.1 to 0.25 M NaCl at temperatures of 
about 37°C to about 55°C, while a low stringency condition may be provided by about 0 1 5 M to 
about 0.9 M salt, at temperatures ranging from about 20°C to about 55°C- Thus, hybridization 
conditions may be readily manipulated depending on the desired results. 

0 The following codon chart may be used, in a site-directed mutagenic scheme, to produce 

nucleic acids encoding the same or slightly different amino acid sequences of a given nucleic add: 
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Amino Acids 



4 : Cysteihe. 
^ Aspiart?c-jaGid 
V4 Glutamic acid 

Phenylalanine 

Glycine 

Histidirie 



Lysine ,; 
Leucine y 
Methionine 



Proline 
Glutamine 



Serine 

Threonine ' : 

: Vialine ':\V]t 
fir'.! Trj^ophan .V 
tyrosine/ . ,< 



TABLE 1 



Codons 



Ala 
Cys 
Asp 
Glu 
Phe 
Gly 
His 
He 



Leu 
Met 

: : Asn 
Pro 
i Gin 



Ser 
Thr 
Val 



•A- 
C 
D 

E '■■ 

G 
H 
I 

K 

L .;• 

M 
N 
P 

Q/ 
s 

T. 



■> ■ 



GCA 
UGC 
GAC 
GAA 
UUC 
GGA 
CAC 
AUA 
AAA 
UUA 

AUG ; 

AAC 
CCA 
CAA 
AGA 
AGC 
•W^CA ^ 
GUA 
UGG 
;i : UAC 



GCC GCG 
UGU 

GAU ■-. 
GAG 

UUU v.-; 
GGC GGG 
CAU : 
AUC AUU 
AAG 

UUG CUA 



GCU 



GGU 



CUC CUG CUU 



AAU 

GCC 

CAG 

AGO.; 

AGU 

ACC 

GUC 



CCG CCU 



CGA 
UCA 
ACX3 
GUG ; 



CGC 
UCC 
ACU 



em/ cqu 

UCQ UCLf: 



V - 



In other embodiments, hyb^i 



0 -i-j '-V 



5, 50 



■ ^ 37?C. Other hybridization conditions utilized may include 

^pjrp|^a|piy 10 n^ T4s-HCl (pIJ 3.3), 50 mM KCl, 1.5 mM MgCfe, at temperatures ranging 



'1 - 
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In certain embodiments, it is preferred to employ isolated nucleic acids of the present 
disclosure in combination with an appropriate means, such as a label, for determining 
hybridization. A wide variety of appropriate indicator means are known in the art, including 
fluorescent, radioactive, enzymatic or othfcr ligands, such as avidin/biotin, which are capable of 
being delected. In preferred embodiments, one may employ a fluorescent label or an enzyme tag 
such as urease, alkaline phosphatase or peroxidase, instead of radioactive or other environmentally 
undesirable reagents. In the case of enzyme tags, colorimetric indicator substrates are known which 
may be employed lo provide a detection means visible to the human eye or spectrophptometrically, 
to identify specific hybridization with complementary nucleic acid-containing samples. 

In general, it is coniemplated that the hybridization probes described herein are useful both 
as reagents in solution hybridization, as in PGR, for detection of expressionof corresponding genes, 
as well as in embodiments employing a solid phase. In embodiments involving a solid phase, the 
test DNA (or RNA) is adsorbed or otherwise affixed to a selected matrix or surface. This fixed, 
single-stranded nucleic acid is then subjected to hybridization with selected probes under selected 
conditions. The selected conditions depend on the particular circumstances based on the particular 
criteria required (depending, for example, on the G+C content, type of target nucleic acid, source of 
nucleic acid, size of hybridization probe, etc.). Following washing of the hybridized surface to 
remove non-sp^cifically bound probe molecules, hybridization is detected, or even quantified, by 
means of the label. 

It is understood that this disclosure is not limited to the particular probes disclosed herein 
and particularly is intended to encompass at least isolated nucleic acids that are hyhridizable to 
nucleic acids comprising the disclosed sequences 6r that are functional sequence analogs of these 
nucleic acids. For example, a nucleic acid of partial sequence may be used- to idetitify a 
structurally-related gene or the full length genomic or cDN A clone from which it is derived. 
Methods for generating cDNA and genomic libraries which^may be used as a target for the above- 
described probes are knoym in the art (Sambrook el dl ,1089). 

For applications in which the nucleic acid segments of the present disclosure are 
ihc^r^rated into vectors, such ^ pl^niidsi fedsmids of vifuseis^ these segments be combined 
wth dthef fc)NA sequences; siich as 1 promoters, poiyadferf^liation signals, restrictifeta enzyme sites, 
multiple cloning sites, other coding segments, and the like; such that their dVeniH lengft niay vary 
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considerably. It is contemplated that a nucleic acid fragment of almost any length may be 
employed, with the total length preferably being limited by the ease of preparation and use in the 
intended recombinant DNAprotoCoL *" ' ^ 4- 

t)N A segments encoding a specific gene rriay be introduced into recOmbiiitotbiost cells for 
5 expressing a specific structural or regulatory protein. Alternatively, through the application of 
'"genCtic engineering techniques, subportioite or derivatives of selected genes ihaiy be employed. 
Upstream regions containing regulatory regions such as proitiotCT regions niaty be isolated and 
subsequently employ ed for expression of the selected gene. 

Where an expression product is to be generated, it is possible for the nucleic add sequence 
1 0 to be varied while retaining the ability to encode the same product Reference to the codon chart, 
provided in Table 1, enables the design of any nucleic acid encoding the same protein oi: peptide 
product 

B* Encoded Proteins 

15 Once the entire coding sequence of a marker-associatedgene has been determined, the gene 

may be inserted iiltb ah appropriate expression system. The gene may be expressed in any number 
of different recombinant DNA expression systems to generate large amounts of the polypeptide 
product, which may then be^urified anci useft to vaccinate animals to generate antisefa which may 
' also be useful in the practice bf the disclosed invention; Fdr fexaitiple,* polyclonal or monoclonal 

20 antibodies may be pr^aiefl that qfecffically bind to the proteinuria 1 

* g©ne. Such -^tibodies may ife inCoiporatid into kits that' may in turn be used for'dete^on and 
diagnosis of the disease state in peripheral blood or other tissue samples. v ' v V;y " • 

* / Examples of expression j^stems known in the art include bacteria such £L coli, yeast 
such as Stibdtir<>mycies cere^ mai^^M^e^f^ssion 

25 ^ #s;tems su6h toi or CHO cSl& 1^ colt 
' **%i<i ^ may ^ expressed br* -alternatively, 

f " *%agitt^^ :: 

In ohe embo^in!i6iit, the gene Sequeiibb ericbding the poly^^ptidfe is ^alyzetf t6 detect 
putative transmembrane sequences. Such sequences are typically very hydrbp&btt^ 
}0 detected by the use; of sequence analysis software, such as Lasergene (DNAstar, Madison, WI). 
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The presence of transmembrane sequences is often deleterious when a recombinant protein is 
synthesized in many expression systems, especially K coli 9 as it leads to the production of insoluble 
aggregates that are difficult to renature into the native conformation of the protein. Deletion of 
transmembrane sequences typically does not significantly alter the conformation of the remaining : 
5 ~ protein structure. 

Moreover, transmembrane sequences, being by definition embedded within a membrane, 
are inaccessible. Antibodies to these sequences will not prove useful for in vivo or in situ studies. 
Deletion of transmembrane-encoding sequences from the genes used for expression may be 
achieved by conventional techniques. For example, restriction enzyme sites may be used to excise 
10 the desired gene fragment, or PCR-type amplification may be use4 to amplify only the desired part 
of the gene. 

In another embodiment, computer sequence analysis is used to determine the location of 
predicted major antigenic determinant epitopes of ;the polypeptide. Software capable of carrying 
out this analysis is readily available commercially. Such software typically uses conventional 
15 algorithms such as the Kyte/Doolittleor Hopp/Woods methods for locating hydrophilic sequences 
which are characteristically foundkon the surface of proteins and are, therefore, likely to act as 
antigenic determinants. ,- 

+ Once rthis analysis is made, polypeptides may be prepared which contain at least the 
essential features of the antigenic determinant and which may be employed in the generation of 
20 antisera against the polypeptide. Minigenes or gene fusions encoding these determinants may be 
c^iis expression vectors b^r conventional methods, for example, using PCR 

cloning methodology. ..., v ..... 

K ^ !; A geii^ OT:gW ^pj^e^coding ajjoljpej^de ma^ k inserted into an epession vector 

by . iPpnyentional subcloning techniques, . ^ ane embodiment, an & cqli expression vector is used 
25 ; , wluchproducesthe r^ombin^tj^lyj^ptid^ fusion protein, allowing rapid affinity purification 
r are the j^ut^ione S- 

transferase system (Phan^ 

© system (101* New Haven* CT), a»d ;^he £*His system (Qiagen, 

.. r; . -, v ;/v, ;;;: ,,-. : . .. - vS;-:';v^W 
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Some of these systems produce recombinant pblypeptides bearing orily a small number of 
additional amino acids, which are unlikely to affect the antigenic character of the recombinant 
polypeptide. For example, both the FLAG system and the 6xHis system add only short sequences, 
both of which are known to be poorly antigenic arid which do riot adversely affect folding of the 
5 polypeptide to its native conformation. Other fusion systems produce polypeptide where it is 
^desirable to excise the fusion partner from the desired polypeptide. In brie embodiment, the fusion 
partner is linked to the recombinant polypeptide by a peptide sequfettce containing a specific 
recognition sequence for a protease. Exairiples of suitable sequences are tho^e recognized by the 
"%^bbacto fetch Virus protease (Life Technologies, Gmtherebiirg, MD) or Falctbr Xa (N ew England 
10 "Biolabs, Beverley, MA). 

In another embodiment, the expression sy^terri used is one driven by the baculovirus 
|K)iyhedron promoter. The: gene encoding the polypeptide riiay be manipulated by conventional 
techniques in ordfer to facilitate cloning into the baculovirus Vector. One bactilovirus Vector is the 
pBlrieBac vector Onvitrogen^ Sorrento, GA). The vector carrying the gene for the polypeptide is 
15 trarisfebted into Spodoptera jhigiperda (Sf9) cells by conVbntioiiail protocols, and the cells are 
criltdfed arid processed to produce the recombinant antigen. See Suriimers e/ A MANUAL OF 
MEtHbDS FOR BACULOVIRLrS VECTORS AND INSECT CELL GtJLTURE 
' W6(&^ Agricultiu^Ekpenirierita Station; U.S. Patfent No. 4,215,051 (incorporated 

by reference). 

. • - > .. ■. .. ■ ■ . 

20 As an alternative to recombinant p6fypepti<le$/ synthetic peptides corresponding to the 

antigenic determinants may be prepared. Such peptides are at least six amino acid residues long, 

and itt&y cbritaM tip to approximately 5(E) 1 residues, which is the aipproxiniate upper length limit of 

" ^^autbiriafed peptide syrithesis machines, Such as those available from AppM6d BiOsystems (Foster 

" HCity,CA) Use of sticb small peptides for vaccination'^ 

25 ^ to M iinM^o as hepatitis B surface ^tigeh, ^ 

^lk>Virte -Mnhm-^im^ kndwnin^art 

^ - t< ^ «nt^dliM^S^ iianiitto agift'^uehcei^^tS of the polypeptide may be prepared 

* yii ^it!^n^'^^a&^ 9 be minor sequeric^Varia^ whfch arise due tfr natural 

Variation within the population or they may be homologues found iri <>thet ^ecies. They iilso may 

30 be sequences which do not occur naturally but which are sufficiently similar that they function 
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similarly and/or elicit an immune response that cross-reacts with natural forms of the polypeptide. 
Sequence variants may be prepared by conventional methods of site-directed mutagenesis such as 
those described above for removing the transmembrane sequence. 

Amino acid sequence variants of the polypeptide may be substitutional, insertiorial or 
5 deletion variants. Deletion variants lack one or more residues of the native protein which are not 
essential for function or immunogenic activity, and are exemplified by the variants lacking a 
transmembrane sequence describe^ above. Another common type of deletion variant is one lacking 
secretory signal sequences or signal sequences directing a protein to bind tp a particular part of a 
cell. An example of the latter sequence is the SH2 domain, which induces protein binding to 

10 phosphotyrosine residues. 

Substitutional variants typically exchange one amino acid fpr another at one or more sites 
within the protein and may be designed to modulate one or more properties pf the polypeptide, such 
as stability against proteolytic cleavage. Substitutions preferably are conservative, that is, one 
amino, acid is replaced with another of similar shape and charge. Conservative substitutions are 

15 well known in the art and include, for example, the changes of: alanine to serine; arginine to lysine; 
asparagine to glutamine or histidine; aspartate to glutamate; cysteine to serine; glutamine to 
asparagine; glutamate to aspartate; histidine to asparagine or glutamine; isoleucine to leucine or 
valine; leucine to valine pr isoleucine; lysine to arginine or glutamine; metMoi^e to leucine or 
isoleucine; phenylalanine to tyrosine, leucine or methionine; serine to threonine; threonine to 

20 serine- tryptophan to tyrosine; tyrosine to tryptophan or phenylalanine; and v^ine to isoleucine or.. 
J^pijoe.^ ■ "" ■"- ' v... •- A./: - ' • - 

the^lypeptide and also may include hybrid proteins containing sequences from pthear iiomologoxis 
i protems md polypeptides. For example, an insertional variant njay includeportions of the amino 
25 \ acid sequence of the • polypeptide from one species, together with portions of the homologous 
PPb^PtidejBrpjm include,|hpse in wlu^^d^cmal 

- = } ; amino: apids are introduced withm th$ coding sequence of the polypeptide. These typically are 
f .. " t^naller insertions than thgv fusion proteins d^^b^d above, and 'are intrpd^c^ ^ or e ? c ^W > * e ' to 
v dismptaprotease cleavage site. 
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In one embodiment, major antigenic determinants of the polypeptide are identified by an 
empirical approach in which portions of the gene encoding the polypeptide are expressed in a 
recombinant host, and the resulting proteins tested for their ability to elicit art immune response. 
For exaniple, PGR may be used to prepare a range df peptides lacking successively longer 
5 ^fragments of the C-tenninus of the protein. Thb 1 immunoprbtective activity of dach 6f these ' 
^peptides then identifies those fragments or dbrfiaihs of the polypeptide which are essential for this 
activity . Further studies in which only a small nuriiber of amino acids are removed at each iteration 
then enables the location of the antigenic determinants of the polypeptide. 
^ Another embodiment for the preparation of polypeptides according to the disclosure is the 

. ...... - . : . , ....... 1 . .< ■ . > 

10 use of peptide mimetics. Mimetics are peptide^ritaining mble^ mimic elements of 

"^roteih secondary structure. See, for example, Johnson c/ 'T^ptide Turn Mimetics" in 
BIOTECHNOLOGY AND PHARMACY, Pezzuto et a/.;! Edsfc, Chapman and Hkll; New York 
1 (1 993). The underlying rationale behind "the use of peptide mimetics is that the peptide backbone of 
prbteins exists chiefly to orient amino acid side chiains in such a way as to facilitate molecular 
15 interactions, such as those of antibody and antigen. A peptide mimetic is expected to permit • 
molecular interactions similar to the natural mdlecule. 

Successful applications of the peptide mi^^ on mimetics 

of fJ-tums within proteins, which are known to be highly antigenic. Likely p-turn stocture within 
an polypeptide may be predicted by computer-based Algorithms as discussed above. Once the 
20 component amino acids of the turn aire determined, peptide mimetics^may be contracted to achieve \ 
a similar spatial orientation of the essential elementi df the amind acid side chains. 

25 The cDNAs df %^uto^ 

%0S6l3: X035S&, M28130, Y00787, SEQ WNOtf;smtWM®a 9 Sm ID NO:3, SEQ ID NO:4, 
7 ^%Mp^ £ M:^l &r -iSlBQ* n> NOif9%i^y bte^cptess^^ ertcoded p^3e$ or proteiris. The 
' • engineering of DNX e^ressidtt r ?n ai pr#fcaryotic or eukaryotic system snay be, 

^erforii^d^ 
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virtually any expression system may be employed in the expression of the claimed isolated nucleic 
acids. 

Both cDNA arid genomic sequences are suitable for eukaryotic expression, as the host cell 
: generally processes the genomic transcripts to yield functional mRNA for translation into protein^ 
5 In addition, it is possible to use partial sequences for generation of antibodies against discrete 
portions of a gene product, even when the entire sequence of that gene product remairis unknown, 
o Computer programs are available to aid in the, selection of regions which haye potential 
immunologic significance. Software capable of canning out this analysis is. readily available 
commercially, for example MacVector (IBI, New Haven, CT). The software typically uses 
10 conventional algorithms such as the Kyte/Doolittle or Hopp/Wpods methods for locating 
hydrophilic sequences which are characteristically found on the surface of proteins and are 
therefore; likely to act as antigenic determinants. 
'■:>) It may be more convenient to employ as the recombinant gene a cDNA version of the gene. 

It is believed that the use of a cDNA version provides advantages in that the size of the gene is 
15 generally much smaller and more readily employed to transfect the targeted cell than a genomic 
gene, which is typically up to an order of magnitude larger than the cDNA gene. However, the 
possibility of employing a genomic version of a particular gene or fragments thereof is specifically 
i ; contemplated. * ■ - , . 

As used herein, the terms "engmeered" and "recombinant" cells are intended to refer to a 
20 cell into which an exogenous IM^A segment or gene,,such as a qDNA or genehas been introduced. 
Therefore, engineered cells are distinguish^ 

a recombinantly introduced exogenous DNA segment or gene. Engineered cells are thus cells 
having a gene or genes introduced through tbe hand ^cdf man, ReTOmb^^ those 
having an introduced cDNA or genomic gepe,, arid alsp include genes positioned adjacent to a 
25i If.?;, promoternot Q$£!ii^ gene, r. %'< 

'\&y}< j-V ^ wild-type, in 

; ; -{To teing a eo#ng se^^^^ promoter, or tq ^Q|crat^eiy |inlc M to a 

30 promoter, one positions the 5' end of the transcription initiation site of the transcriptional reading 
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frame generally between about 1 arid abotot 50 

promoter. The "upstream" promoter stimulates transcription of the DNA and promotes expression 
of the Encoded recombinant protein. This is the meaning of "recombinant expression" in this 
contort. 

5 Many conventional techniques arc available to construct expression vectors containing the 

^appropriate nucleic acids and transtriptional/translatfoh^ in order to achieve 

./ protein or peptide: expression in a variety of host-expression systems. Cell types available for 
expression ihchide, but are not I united to, bacteria, such as £ coli and B: subtilis trarisfdrined with 
recombinant bacteriophage DNA. plasmid DNA or cosmid DNA expression vectors. 
10 Certain examples of prokarybtic hosts are R coli strain RR1, K coli LE392, 2?. coli B, 

±coliX 1776 (ATCC No. 3 1 537) as well as £ coli W31 16 (F-, lambdas, prototrophic, ATCC No. 
27332$); badlli such as Bacillus subtilis; and bther enterobacteriaceae such as Salmonella 
t^himurium, Serrafia marcescens y and various Pseudomonas specie^. 

In general, plasmid vectors containing repiicon and control 'sequences which are derived 
15 from species compatible with the host cell are used in bonnection \vith these hosts: The vector 
brdinarily carries a replication site, as weir as marking' sequences Which are capable of providing 
phehotypic selection in transformed cells. For example, 21 coli is oftefi ^U^ns^rmed using pBR322, 
a plasmid derived from an E coli species. pBR322 contains geftes for ampicillin and tetracycline 
resistance and thus provides easy means for identifying transformed cells. The pBR plaSmid, or 
20 Other microbial plasmid or phage must also contain* or be modified to contain, promoters which 
may be u^ed by the microbial Organism fbr expression of its oWh proteins. 

In addition, phage vectors containing repiicon and control s^querices ^ 
iv^thetobSfr mib^dbrg^OTi may be usedas transforming vectors in connection vWth these hosts, 
tofe^pte;^ 

25 vt)£j^^ A ( - 1 

Furfe usiSil VecibrS include pm vectors (Inbuyee/ d/; 1985^and pGEX VeSctof$,for use 
^ ^ ^ and 
* T ^ep^tidiiW Cleavage. Other suitable fustoft proteins k£ iQio^e with &gaftS^^dafce,^ or 



J, 

WO 98/24935 PCT/US97/22105 

36 '- 

Promoters that are most commonly used in recombinant DNA construction include the |5- 
lactamase (penicillinase), lactose and tryptophan (trp) promoter systems. While these are the most 
commonly used, othp microbial promoters have been discovered and utilized, and details 
concerning their nucleotide sequences have been published, enabling their ligation into plasmid 
5 vectors. 

For expression in the plasmid YRp7, for example, is commonly used. This 

plasmid already contains the trp\ gene which prpvides a selection marker for a mutant strain of 
yeast lacking the ability to grow in tryptophan, for example ATCC No. 44076 or PEP4-1 . The 
presence of the trp\ lesion as a characteristic of the yeast host cell genome then provides an 

10 effective environment for detecting transformation by growth in the absence of tryptophan. 

Suitable promoting sequences ^ in yeast vectors include the promoters for 3- 
phosphoglycerate kinase or other glycolytic enzymes, such as enolase, glyceraldehyde-Srphosphate 
dehydrogenase, hexokinase, pyruvate decarboxylase, pho$phofructokinase, glucose-6-phosphate 
isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase,; 

15 phosphoglucose isomerase, and ^ucokinase. In constructing suitable expression plasmids, the 

......... ... . . ... A 

termination sequences associated with these genes are also ligated into the expression vector 3 1 of 
the sequence desired to he expressed to provide polyadenylation of the mRNA and termination. 

Other suitable promoters, which have the additional advantage of transcriptipn controlled 
ibyJBJcw^ conditions, include the promoter region for alcohol dehydrogenase 2, isocjftoqbarpme C, 

2 ^L-ri-'. ; ^^H ^'P^sp^tos^ \dqgtfdgfii^ mm - ^3!™^??-:; <as^ia*ec| . with nitrogen metobpUatn, and the: 
aforementioned glyce^dehyde-3 r phosph^^ enzymes jesponsible for maltose 

■k?: );><.: if ^s^kicMi tp microorganisms,^ of cells derived ^ 
,,- ... J :.^also: be u^4' as hosts. Jn pjinciple> <any such cell culture is workable, whe&^^ or 
25 invertebrate culture. In additiorrto mammalian cells, these include insect cell ^^e^ i^ected with 

systems infected with 

.^m^ yh^ex^ C^ls/ty^ lob^qc^ mqsaic virus, 

, ni,^ (e,j^,^j^as|m^) ^tainiiig 

one or more coding sequences. V. 
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In a useful insect system, Autographic* californica mxclear polyhidrosis virus (AcNPV) is 
used as a vector to express foreign genes. The virus grows in Spodopterajrugiperda cells. The 
isolated nucleic acid coding sequences ate cloned into nonessential regions (for example the 
pblyhedrin gene) of die virus and placed under control of an AcNPV promoter (for example the 
5 polyhedrin proihbter). Successful insertion of the coding sequences results in the inactivation of 
ifche polyhedrin gene and production of non-occluded recombinant virus (i.e., virus lacking the 
proteinaceous coat coded for by the polyhedrin gene). These recombinant viruses are then used to 
infect Spodoptera frugiperda cells in which the inserted gene is expressed (e.g.; U.S. Patent No. 
^15;051 (Smith)). 

10 • * Examples of useful mammalian host cell lines are VERO and HeLa cells, Chinese hamster 
"ovary (CHO) cell lines, Wl 3 8, BHK, COS-7, 293, HepG2, 3T3, l RIN and MDCK cell lines. In 
Addition, a Host cell strain may be chosen that modulates the expression of the inserted sequences, 
or modifies and processes the gene product in the specific fashion desired. Such modifications 
(e.g., glycosylation) and processing (e.g., cleavage) of protein products may be important for the 

15 function of the encoded protein: 

Different host cells hive characteristic and specific mechanisms for the post-translational 
processing and modification of proteins. Appropriate cells lines or host systems may be chosen to 
help ensure the correct modification and processing of the foreign protein expressed. Expression 
vectors for use ta^ 

20 front of the gene to be expressed, along with any necessary ribosome binding sites, RNA Splice 
sites, polyadeiiy Mti 6n site, and transcriptional terminator sequences. The origin of replication may 
be provided either by construction of the Vector to include an ekbgenbus origin^ sucfa as may be 
^^Vd3 iftoirf SV40 or 6ther viral (e g , 1*01 yoma^ Adteno, V^;BP^ sburce, or may be provided 
by the Host cfell chromosomal replication mechanism. If the Vector i^ integrated into th^ host cell 

25 ^hroihosome, the latter is often sufficient . 

^ ' - The promoters may be derived from the geno'nie o£mammMian cells (e.g., inetaliothionein 
ptomoter) W from mammalian viruses (e.g., the adenovirus late promoter; the Vaccinia virus 7.5K 
prbmdtei); Farther, it is also possible to utilize promoter or control sequences normally associated 
With fhe g£fte sequence df interest, p^ 
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A number of viral based expression systems may be utilized- For example, commonly used 
promoters are derived from polyoma, Adenovirus 2L> and most frequently Simian Virus 40 (SV40). 
The early and late promoters of SV4G virus are particularly useful because both are obtained easily 
from the virus as a fragment which also contains the SV40 viral origin of replication. Smaller or: 
5 larger SV40 fragments ma\ also be used, provided there is included the approximately 250 bp 
sequence extending from the Hind III site toward the Bgl I site located in the viral origin of 
replication. 

In cases where an adenovirus is used as an expression vector, the coding sequences may be 
ligated to an adenovirus transcription/ translation control complex, e.g., the late promoter and 
10 tripartite leader sequence. This chimeric gene may then be inserted in the adenovirus genome by iw 
vitro or in vivo recombination. Insertion in a non-essential region of the viral genome (e.g., region 
El or E3) results in a recombinant virus that is viable and capable of expressing proteins in infected 
hosts. 

Specific initiation signals may also be required for efficient translation of the claimed 
15 isolated nucleic acid coding sequences. These signals include the ATG initiation codon and 
adjacent sequences. Exogenous translational control signals, including the ATG codon, may 
additionally need to be provided. This need is readily determinable and the necessary signals 
readily provided. It is? well known that the initiation codon must be in-irame (or in-ghase) with the 
reading frame of the desired coding sequence to help ensure translation of the entire insert These 
20 exogenous translational control signals and initiation cpdons may be of a variety of origins, both 
natural and synthetic. The efficiency of expression may be enhanced by the inclusion of 
appit^ 1997), 

> ( In eukaryotic expression, it is tj^ic^ly prefcired to incorporate into 
; an appropriate polyad^ site (e.g., 5^AA]TAAA-3 r ) if one was not contained within the 

25 original cloned segment. Typically, the poly A addition site is placed about 30 to 2000 nucleotides 
; ; "do^ynstream" of the termination site of the protein at a position prior to transcript^ 

For : Ipng-tenn, high-^yield production of recombinant proteim, stat>|e; expr^s^on is 
For example, cell lines that stably express constructs encoding proteins - way be 
father than using expression vectors that contam vi^ cells 
30 may be transformed with vectors controlled by appropriate expression control el^nepts (e.g., 
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promoter or enhancer sequences^ transcription terminators, polyadenylatioii sites, etc.); and a 
selectable marker. Following the introduction of foreign DNA, engineered cells may be allowed to 
grow for 1-2 days in an enriched medium and then are switched to a selective media. The 
selectabfe barker in the recombinant plasmid confers resistance to the transformant and allows 

5 cells to stably integrate the plasihid into their chromosomes and grow to form foci which in turn 
n&y be cloned and expanded into' cell lines. 

A nuirtbci «i If selection systems may be u&bd, including^ but not limited, to the herpes 
simplex vims thymidine kinase (Wigler et at, 1977), hyjtox^thine-guanine 
^^ltosphonK>N\ (transferase (Siybalska et aL, 1 962) and adenine phosphoribosyltransferase genes 

10 (Lttwy cir at . 1<>S0). in lit-; hgprt- or^rt- dfells, respectively; Also, antimetabolite resistance may 
Ire u^^ for dhfr, that confers resistance to methotrexate (Wigler £?/ aL, 

1980; O'Hare et at, 1981); gpt, that confers resistance to mycophenolic acid (Mulligan et al y 
1981); nco; that con fcrk resistance to the aminoglycoside G-4 1 8 (Colberre-Garapin et al, 1981); 
and hy gro; that confers resistance to hygromycin. 

15 It is contemplated that tfe is6lMed ftucte 

Le; y expressed in infer cia^dle^I^retatiKreto their natural expression in normal human cells, or even 
relative to the expressibri of other overexpression may 

" be asi&S^ 

V 5 $un£ie md dfr&t niseis are j^efeiT^a, for exaitiple^ffi^ 
20* stkihiri^ or We^terB blotting:, WlloWed by 4^dntitative analyses 
the feiilt^t^^ 

the level in Plural human cells is indicative bf over expresijfdn> as ik a relative 
'MbtiMaiicb of tfie Specific proteitt itt relation to the other proteirts produced by the host cell aiiid, e.g., 
visible on a gel. - ■ ■ ' Y-:<i -■-..^■yO^i -^ryvS-^^r^ 

<fc i-fip "JM j- :/i ■>-,, ipttrtfi^f ^pebts of the preseiit di&losiire Cdiiberii the puirfiiG^tiO&^d in particular 
" ? ' ^il^ liie temi ^purified 

• pr&eitt^^ti^ other 
3tf ? e^p^^ 
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obtainable state, i.e., in this case, relative to its purity within a cell extract. A purified protein or 
peptide therefore also refers to a protein or peptide, free from the environment in which it may 
naturally occur. 

Generally, "purified" refers to a protein or peptide composition which has been subjected to 
5 fractionation to remove various other components, and which composition substantially retains its 
expressed biological activity. Where the term "substantially purified" is used, this refers to a 
composition in which the protein or peptide forms the major component of the composition, such 
as constituting about 50% or more of the proteins in the composition. 

Various methods for quantifying the degree of purification of the protein or peptide aie 
10 known in the art. These include, for example, determining the specific activity of an active 
fraction, or assessing the number of polypeptides within a fraction by SDS/PAGE analysis. A 
preferred method for assessing the purity of a fraction |s to calculate the specific activity of the 
fraction^ to compare it to the specific activity of the initial extract, and to 

of purity, assessed by a "-fold purification number". The actual units used to represent the amount 
15 of activity is dependent upon the particular assay technique chosen to follow the purification and 
whether or not thp expressed protein or peptide exhibits an enzymatic or other activity. 

■■ . . Various techniques suitable for use in protein purification are known in the art These 
include, for example, precipitation with ammonium sulfate, PEG, antibodies and the like or by heat 
x denaturation, followed by c^ntrifugation; chromatography steps such as ion exchange, gel filtration, 
20 reverse phase, hydroxylapatite> and affinity chromatography; isoelectric focusing; gel 
electrophoresis; and combinations of such andother techniques. As is generally known in the art, it 
; i§ b^liey^d that order of conducting the various purification steps may be changed, or that 
; eertafo steps may be omitted, and still result in a suitable method for the preparation of a 
substantially purified protein or peptide. 
25 There is no general requirement that a protein or peptide always be provided irt its most: 

purified state. Indeed, it is contemplated that less substantially purified products have utility in 
; ; certain;CTi^©diments. Parti^ purification ma^ be acwm 

:cpii^inati^i, or by utilizing different forms pf Ae^same geperal purifieatipn scheme. Fpr .example, 
* it? is; appreciated that a cation-exchange column chromatography performed utilizing an HPLC 
30 v r r;^; apparatus generally results in a greater -fold purification than the same technique utilizing a lovtf ; 
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J>ifessure ctoomatbgrapHy system ; fvlettibds exhibiting a lower degree of relative piinficatioh may 
have advantages in total recovery of protein product; or in maintaining the activity of an expressed 
protein. 

It is known that die migration of a polypeptide may vary, sometimes significantly, with 
5 IdiSferent conditions of SDS/PAGE (C^>aldi ei dl, Biochem. Biophys^ Resi Comm., 7(5:425,- 1977) 
l; ^it is therefore appreciated that under differing electrophoresis conditions, the apparent molecular 
weights of purified or p^ally piirified expression products may vary . : 

u \ 21 Antibody 'Generation 
10 ; h For some embodiftiehts, it is preferred to prbdufce antibodies that biiid 

iqpiecificity to the protein pfoduct(s) of ^ isolated nucleic acid of a Sequence compri^g Ciehebank 
" • Accession timbers D87451, T030i 3^ X03558, M2«l 30; Y00787i SEQ ft* NO:lV H>^NO:2, 
" SEQ ID NO:3, SEO ID ^0:4, SEQ ID NO:5, or SEQ ID NO:29. Means for prepaKiig and 
chiracteriang antibodies are well kiio\Vn in the art (See, e.g., Antibodies: A Laboratory Manual, 
15 Cold Spring 

Methods for generating polyclonal Antibodies 5 are well known in the ait. Briefly, a 
polyclonal antibody is prepared by immunizing an animat with an immunogenic composition and 
collecting antisera ^ A wide range of animal species ih&y be used for 

the jtf bdiubtibri of atitiseira, including rabbits, mice, rate, h^^ Because of 

20 the f elati veily large * blood vbltiixie of rabbits, a rabbit is a preferred choice for production of 

' /y ^^ls%ell iaioWh^ is 

often necessary therefore to boost the host immune system, as may b#&^ a 

^ ^ are keyhole 

25 

^mb^ Mc^itei^jb^i^ating 

" <v - ^^y^e^dfe tc> a c^er pmteih &&%elK^^ to ttte fcrt m& ifisiiide glut^iiat&yde, 
m-male*^^ 

-< -:r ^ v ^ -A^n^ -^^^^eB knbS^n m immunogen 

1 i^potise, 
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, known as adjuvants. Exemplary and preferred adjuvants include complete Freund's adjuvant (a 
non-specific stimulator of the immune response containing killed Mycobacterium tuberculosis) y 
incomplete Freund's adjuvants and aluminum hydroxide adjuvant 

The amount of immunogen composition used in the production of polyclonal antibodies 
5 varies with the nature of the immunogen as well as the animal used for immunization. A variety of 
routes: may be used to administer the immunogen (subcutaneous, intramuscular, intradermal, 
intravenous and intraperitoneal). The production of polyclonal antibodies may be monitored by 

sampling blood of the immunized animal at various points following immunization. A second, 

\ 

booster, injection may also be given. The process of boosting and titering is repeated until a 
10 ; suitable titer is achieved. When a desired level of immunogenicity is obtained, the immunized 
animal may be bled and the serum isolated and stored, and/or the animal may be used to generate 
monoclonal antibodies. For production of rabbit polyclonal antibodies, the animal may be bled 
through an ear vein or alternatively by cardiac puncture. The removed bipod is allowed to 
coagulate and then centrifixged to separate serum components from whole cells and blood clots. 
1 5 The serum may be used as is for various applications or else a particular antibody fraction may be 
purified by well-known methods, such as affinity chromatography using another antibody or a 
peptide bound to a solid matrix. 

Monoclonal antibodies (MAbs) may be readily ^ use of well-known 

techniques, such as those exemplified jn U.S. Patent 4,196,265, incorporated herein by reference. 
20 Typically, this technique involves immunizing a suitable animal with a selected inmiunogen 
composition, e.g., a purified or partially purified expressed protein, polj^pjtide or peptide. The 
., in^ manner effective to stimulate 

- : :<?^11$;, as described above. ; \V;. ■ 

Thq methods for generate 

25 - Imes as those for preparing polyclonal antibodies. Rodejo^s such as n*ice and rats are preferred 
ia^uM^ia| % however, the use of rabbit, sheep or frog cells is also po^bie* *nay 
: \.&]^^ but mice are preferre 

ippij^ b^ing most preferred as, ^s gene^ly ? giy^s a higher piercpnt^ge of stebie fosipn?- ^ 
.,, ^ni^ T^amm^ls ^ inj^t^d ; ^ith antigen as described aboye v The antigen may> be cotipled to 
30 ^c^rrier molecules such as keyhole limpet bernocyai^ i£ necessary. The anti^en ; is typically minced 
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^ with adjuvant, such as Freund's complete or incomplete adjuvant. Booster injections with the same 

antigen typically occur at approximately two- week intervals. 

Following immunization, somatic cells with the potential for producing antibodies, 
specifically B lymphocytes (B cells), are selected for use in the MAb generating protocol. These 
5 ^ "cells may be obtained from biopsied spleens, tonsils or lymph nodes, or from a peripheral blood 
^sample. Splben cells and peripheral blood cells are preferred, the former because they are a rich 
source of antibody-producing 

peripheral Wood is easily accessible. Often, & panel of animals are immunized and the spleen of the 
^toimal with the highest antibody titer is removed and the spleen lyitiphocytes obtained by 
10 homogenizing the spleen with a syringe. Typically, a spleen from an immiinized ttiouse contains 
Approximately 5 X 10 7 to 2 X 10 8 lymphocytes: 

The mtibody-producing B lymphocytes from the imriiuhized animal are then fused with 
cells of an immortal myeloma cell, generally one 6f the skrhe spebies as the animal that was 
immunized. Myeloma cell lines suited for use in hybridoma-producing fusion procedures 
15 preferably are non-antibody-producing, have high fusion efficiency, and enzyme deficiencies that 
render then incapable of growing in selective media which support the growth of only the desired 
fused cells (hybridomas). 

Any one of a number of myeloma cells mky be used, as are known in the art (Goding, pp. 
65-66, 1986; Campbell, pp. 75-83, 1 984). For Example, where the ih^tirii:^ 
20 one may use P3-X63/Ag8, X63-Ag8.653, NSMAg 4 1 , ^iWigl4, FO, NS0/U* MPCM 1, ^ 
1^PC11-0C45-GTG 1.7 and S194/5XK0 Bui; for tats, one may use mi 0.R€Y3> Y3-Ag 1-23, . 
^ W83F and 4B21 0; and U-266, GM150^ 

connection with human cell fusions. ' 
• One preferred murine myeloniaceil is the NS-1 myeloma cell line (also termed P3-NS-1- 

25 ' Ag4-1>, whfch is readiiy available fr&m the N1GMS Human Genetic Mutant Cell Repository by 

-requesting cell line repository number GM3573. Another mouse i inyeloina celf^ be ; : > 

^tiSed is the 8-azaguamne-r^S murine myeloma SP2/0 hbn-producer cell line. 

^ Methods for generating and 
' - ^tiiybioina cdlS- usually comprise mixing somatic cells with myeloma edits in a 2:1 proportion, 
30 ^ tltough^ 
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agent or agents (chemical or electrical) that promote the fusion of cell membranes* Fusion methods 
using Sendai virus have been described by Kohler and Milstein (1975; 1976), and those using 
polyethylene glycol (PEG), such as 37% (v/v) PEG, by Gefter et ah (1977). The use of eiectricaiiy 
induced fusion methods is also appropriate (Godingpp. 71 -74, 1 986). 
5 Fusion procedures usually produce viable hybrids at low frequencies, about 1 X 10" 6 to 

I X 10 . However, this does not pose a problem, as the viable, fused hybrids are differentiated 
from the parental, unfused cells (particularly the unfused myeloma cells that would normally 
continue to divide indefinitely) by culturing in a selective medium. The selective medium is 
generally one that contains an agent that blocks the de novo synthesis of nucleotides in the tissue 
10 culture , media. Exemplary and preferred agents are aminopterin, methotrexate^ and azaserine. 
Aminopterin and methotrexate block de novo synthesis of both purines and pyrimidines* whereas 
azaserine blocks only purine synthesis. Where aminopterin or methotrexate is used, the media is 
supplemented with hypoxanthine and thymidine as a source of nucleotides (HAT medium). Where 
azaserine is used, the media is supplemented with hypoxanthine. 
15 . The preferred selection mediumJsHAT. Only cells capable of operating nucleotide salvage 

pathways are able to survive in HAT medium. The myeloma cells are defective in key enzymes of 
the salvage pathway, e.g., hypoxanthine phosphoribosyl transferase (HPRT), and they cannot 
survive. The B cells may operate this pathway, but they have a limited life span in culture and 
generally 4ie within about two weeks. Therefore, the only cells that may survive in the selective 
20 media are those hybrids formed from myeloma and B cells. 

This culturing provides a population of hybridomas from which specific hybridomas ate 
selectedv ; TTypically, selection of hybridomas is performed hy culturing the cells by single-clone 
dilution in microtiter plates, followed by testing the individual clonal supernatants (after about two 
to toee weelcs) for the desired reactivity. The assay should be sensitive, simple and rapid, such as 
25 , radioimmimoassays, enzyme immunoassays, cytotoxicity assays, plaque assays, dot,: 
• ; -.,;^^^i^^!!^hipd^tt^ assays, and th e ; 1 ike* 

The Selected hybridomas /are then : serially ^ dUut^ and clpn^ intp. ^uKjhddual 
ceU lines, which clones may then be propagated indefinitely to provide MAbs. 
^e A sample of the hybridoina 

' may be injected (often into fee peritoneal cavity) into a histocompatible animal of the type that was ; 
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used to provide the somatic and myeloma cells for the original fusion. The injected animal 
developstumors secreting the specific m The 
body fluids of the animal, such as serum or ascites fluid, may then be tapped to provide MAbs in 
high concentration. The individual cell lines may also be cultured in vUro> where the MAbs are 
5 'naturally secreted into the culture medium from which they may be ireaidily bfetiaaned % high 
Concentrations. MAbs produced by either means may be further purified as needed, vising filtration, 
centriftigation and various chromatographic methods such as HPLC or affinity chromatography. 

Large amounts of the monoclonal antibodies of the present disclosure may also be obtained 
by multiplying hybridoma cells in vivo. Cell clones are injected into mammals which are 
1 0 histocompatible with the parent cells, e.g. , syngeneic mice, to cause growth of antibody-producing 
tumors. Optionally, the animals are primed with a hydrocarbon, Especially oils such as pristane 
' (tbtr^ethylpfehtadecane)prior to injection. 

In accordance with the present invention, fragments of hiohoclorial antibodies may be 
obtained by methods which include digestion of monoclonal antibodies with enzymes sucA as 
15 pepsin' or papain and/or cleavage of disulfidfe bonds by chemical reduction. Alternatively, : • 
monoclonal 

ah automated peptide synthesizer. 

The monoclonal conjugates of the present disclosure are prepared by methods known in the 
: art, e.g:> by f^cting a m as described above withj, for instance, an 

20 " enzyme in the presence of a coupling agent such as glutaraldehyde or periodate. Conjugates with 
fluorescein markers are prepared in the presence of these coupling agents cwf by rekdtidn with an 
isothiocyanate. Conjugates with metal chelates are similarly pi^ddufced. GmCT Aoi^ •' " 

%tiMes inay be ^tyugated ihcte l3| I ^P, 35 S, ^C, 5, Cr, M Ck 

<^ & 

25 * iaritib^ 

. .. * ■ • ■ ••«■".' 

r * aeeoni%^ 

hypticMoriie, or Sii em^miatic oxidi^g ag^ii^ such Monoclonal antibodies 

a&ofc^^ teelM^titim? 9 by ligdhd exchange process, for 

30 "* ^ 
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Sephadex column and applying the antibody to this column or by direct labeling techniques, e.g. ■ 
by incubating pertechnate, a reducing agent such as SNCI2, a buffer solution such as sodium- 
potassium phthalate solution, and the antibody. 

It will be appreciated that monoclonal or polyclonal antibodies specific for proteins that are 
5 preferentially expressed in the peripheral blood of individuals with the disease state have utilities in 
several types of applications. Thesis may include the production of diagnostic kits for use in 
detecting or diagnosing the disease state. It will be recognized that such uses are within the scope 
of the present invention. 

10 P* Immunodetection Assays 

L Immunodetection Methods 

In still further embodiments, the present disclosure conceim immimcKletection methods for 
binding:, purifying, removing, quantifying or otherwise generally detecting biological components. 
The encoded proteins or peptides of the present disclosure may bp employed to detect antibodies 
15 having reactivity therewith, or, alternatively, antibodies prepared in accordance with the present 
invention, may be employed to detect the encoded proteinic* peptides* The steps of various useful 
immunodetection methods have been described in the scientific literature, such as, e r g., Nakamura • 
^ .^4 (1987). : . . , . 

^ In general, the immtuiobinding methods include obtsuning a sample suspected of containing 

20 a protein, peptide or antibody, and contacting ttie sample wijft, an, antit^dy or protein or 5 peptide in ( 
accordant witht the present invention, as the case may be * under oonditiops effective to allow the 

7 > / of a 

.reactive component in a , sample, which methods require thg detection or quantitation, of any 
254>^ : ^ a sample ?^u^p^t€Mi of 

-xV?l^P^ encoded protein, peptide: , pr . sl y corresponding j^^bo^ 9 and 

. contacts ti^e s^pl^ witib^ an ^tibody qr ejaQoded^^ and then 

specific coj^oii^> ; 

In terins of ^tigen detection, Ae biological of 
30 - ^peripheral Irfood, However* it may be any, §^mple that is suspected of containing a disease state- 
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specific antigen; such as a lyiripK node tissue 5 section of specimen, a homogenized tissue extract, an 
isolated cell; a ceil membrane preparation- separated or purified forms of any of the above protein- 
Contaihing compositions, or any other biological fluid that comes into contact with diseased tissues,, 
including lymphatic lipid, tirinc and even seminal fluid- - 
5 * Contacting the chosen biological sample ^ or antibody under 

condition^ alloxv the formation of immune "complexes 

primary immune complexes) is generally a matter o^aufeply adding the composition to the sample 
and iiicubatirig the mixture for a period of time 16ng enough for the antibodies to form Immune 
cdinplexes with, i.e., to bind io* any antigbhb 7 presfehl After this time, the sample-antibody 
1 6 composition, such as a tissue section, EE/ISA plate, dot blot or Westerii blot; is generally washed to 
remove any non-specifically bound antibody species, allowing only those antibodies specifically 
bound within the primary irhmiihe complexes to be detected: 

In general, the detecti in the art ^nd may be i 

acbieved through the application of numerous approaches. These methods are generally based 
15 upon the detebtibn 6f a label or m^k^ fluorescent^ biological c* - 

enzymatic tags or labels of conventional vise in the art. U.S. Patents concerning the Use of such 
I^is include 3 4,366,241, 
each incorporated fieteiii by reference: Of c^tfi^e; bne i^ay find additional advantages trough the 

ligarid binding 

20 arrangement-, as is knov^n in the art* " '> v 

'The 6nc6ddd protdin, peptide oi- cbi^e^Onding antibtniy employed in the detection may 
v ' > "- : ^tsSb^%fe v "lltiEk^df r to a detedtable label, therein o^Would-theh sirfiply detect this label, thereby 

AhytiM^ely, ttie first added eompoheAt that becomes b6uHd ttithih the pto^kmiine 
25 cdripfc?^^ 
^ricoM^ 

w ^ Tlie&cdi^ may 

<r " thus be ^ ptiih&y ii^We ;complfex6s aire contacted yrith the 

^ li^t^ sec^^ 
30 sufficient to allow the formation of ^kc&dS^ 
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complexes are then generally washed to remove any non-specifically bound labeled secondary 
antibodies or ligarids, and the remaining label in the secondary immune complexes is then detected. 
Further methods include the detection of primary immune complexes by a two step 
approach. A second binding ligand, such as an aptibody, that has binding affinity for the encoded 
5 protein, peptide or corresponding antibody is used tp form secondary immune complexes, as 
* described above. After washing, the secondary immune complexes are contacted with a third 
£ ; binding ligand or antibody that has binding affinity for the second antibody, again under conditions 
5 : effective and for a period of time sufficient to allow the formation of immune complexes (tertiaiy 
immune complexes). Thethirdlig^dqr antibodyislii^ed to a detectable label, allowing detection 
1 0 ;? • of the tertiaiy immune complexes thus formed. This system may provide for signal amplification if 
this is desired. ,. 

The immunodetection methods of the present disclosure have evident utility in the 
diagnosis of human disease states. A biological or clinical sample suspected of containing either 
the encoded protein or peptide or corresponding antibody is used. However, these embodiments 
15 also have applications to non-clinical samples, such as in the titering of antigen or antibody 
samples, in the selection of hybridomas, and the like. 

In the clinical diagnosis or monitoring of patients with a disease state, the' detection of an 
anfigeri/encoded by a disease state marker nucleic acid, or an increase ih the levels of such an 
antigen, in comparison to the levels in a corresponding biological sample from a normal subject is 
20 indicative of a patient with the disease state. The basis for such diagnostic methods lies, in part* 
with: the finding that the nucleic acid disease state markers identified in the present disclosure are 
? ToPyey^xp^ (see Examples 1 

throu^t t 4 below). -By pension, it may be inferred that at least some of these markers produce 

25 i - > M 

thfc ^* B above which increased 

Jfigi^ high 

30 ; from wthin a tissue that rach give a pps^dye signal. , . £ ,= ...... 
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2: Immuriohistochemistry 

The antibodies of the present disclosure may also be used in conjunction with both 
fresh-frozen and formalin-fixed, paraffin-embedded tissue blocks prepared from. study by* 
5 immunohistochemistry (IHC) or fixed cells on microscope slides for immunocytochemistry. The 
method of preparing tissue blocks from these particulate specimens has been successfully used in 
previous IHC studies of various prognostic factors and is well known to those of skill in the art 
(Brown al 9 1990; Abbondanzo et al y 1990; Allred c/a/., 1990). 

Briefly, frozen^sectionsmay be prepared by rehydrating 50 ng of frozen "pulverized" tissue 
10 at xoorii temperature in phosphate buffered saline (PBS) in small plastic capsules; pelleting the 
particles by centrifiigation; resuspending them in a viscous eibbedding medium (OCT); inverting 
the capsule arid p611etihg again by centrifugatiori; sriap-free23rig in -76- G isopentane; cutting the 
plastic capsule and removing the frozen cylinder of tissuie; securing the tissue cylinder on a cryostat 
micitrtome chuck; and cutting 25-50 serial sections containing ari average of about 500 intact cells. 
15 r Peiinarient-sections may a similar method ihvoliahg rehydration of the 50 

mg sample in a plastic" mierofiige tube; pelleting; resuspending in 10% formalin for 4 hours 
fixation; washln^pelleting; resuspending in warm 2.5% agar; pelleting; cooling in ice water to 
hafden the agar; removing the ti^ue/agar block froni the M infiltrating and embedding the block 
in paraffin; and cutting up to 50 serial penriainerit sections. 

^ " ' : " fexpreSsed proteins may also b& detected by > flbW Cytometry ais described in 
^Fi^i^ma et al, 1996. In the practice of the metodd, the cells are fixed arid then incubated with a 
mondclonal airitibody against the expressed protein tb d^teRcted; The bound antibodies are then 
25 corifc^ detection. Ai^M^ the fluorescent 

or 

•M^-t- 'V; ---:i- ^^^^ '■>::-■:, 

^ ^ light 

30 c^H Mze arid the right ^gle light ^tter^ cell^htoiir arid nuclecN^ytcfpIasmic ration 
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Since cells are tagged with fluorescent labeled antibody they can then be further characterized by 
fluorescence intensity and positive and negative windows set on the FACS to collect bright 

fluorescence and Ig w fluorescence cells. Cells are sorted at a flow .rate of about 3000 cells per 
second and collected in positiv c and negative cells. 

.5, ' , : . - - .. . . 

• 4. _ ELISA ■ 

As noted, it is contemplated that the encoded proteins or peptides of the disclosure 
have utility as immunogens. e.g.. / in connection with vaccine development, in 
immunohistochemistry and in ELISA assays. One evident utility of the encoded antigens and 
1 0 corresponding antibodies is in immunoassays for the detection of disease state marker proteins, as 
needed in diagnosis and prognostic monitoring. 

Immunoassays, in their most simple and direct sense, are binding assays. Certain preferred 
immunoassays are the various types of enzyme linked immunosorbent assays (ELISAs) and 
radioimmunoassays (RIA) known in the art. Immunohistochemical detection using tissue sections 
15 is also particularly useful. However, it is readily appreciated that detection is not limited to such 
techniques, and Western blotting, dot blotting, FACS analyses, and the like may also be used. 

In one exemplary ELISA, antibodies binding to the encoded proteins of the disclosure aie 
immobilized oiito a selected surface e^biting protein affinity, such as a well in a pqlystyiene 
microtiter plate. Then, a test composition suspected of containing the disease state marker antigen, 
20 such as a clinical sample, is added to the wells. After binding and washing to remove non- 
specifically bound immunecomplexes, the bound antigen may be detected. Detection is generally 
achieved by the addition of a second antibody specific for the; target protein, that is linked to a 
*, : ; det^Qtable label. This type of ELISA is a simple "sandwich ELISA". Detection may also be 
, ja^ the addition of a third antil^ody diat has 

25 J>indi^ affinity for the second anti^>dy, with ^e tli^d antibody beiiig linked to a detecl^jblelabeL ? \. 
In another exeinpl 

antigen are immobilized onto the well surface and then contacted with the antiljodies; of the 
. myejption. -After bia4irtg^^ the 
. r bpmd an^gen is djete^ed. 3^^^ are. linked tp a d^tectabte lal^l, the 

30^::-;- ; iipm^^ dSrectly. Ag^rn, the immunecomplexes may be detected using 
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a second antibody that has binding affinity for the first antibody, with the second antibody being 
linked to a detectable label. 

Another ELISA in which the proteins or peptides aire immobilized- involves the Vise of 
antibody competition in the detection. In this ELISA, labeled antibbdies are added to the wells^ 
5 allowed to bind to ihe disease stWte marker protein, and detected by means of their label. The 
iamouiit of marker antigen in ah unknownsample i£ then determined by mixing the samj>fe With the 
labeled an ti h* kI icn be fore or during incubation with coated wells. The presence of marker antigen 
in the sample acts to reduce the amount of antibody available for binding to the well arid thus 
^edifices the ultimate signal. This is appropriate for detecting antibodies in an unknown sample, 
1 6 where the Unlabeled antibodies bind to the aiitigeri-coated v/el\s attd reduces the aiftoxmt of antigen 
available lb bind the labeled antibodies. 

irr^pc^ctive of the such as 

coating, incubating or binding, w^hing tb remove' bound species, and detecting 
the bound immunecomplexes. These are described ; 
15 In boating a plate with either antigen or antibody, it is typical to indiibate the Wells of the 

pl^e with a soiutioh of the afi or antibody, either oybriiight or for a specified period of hours. 
The wells of the plate iire then washed to remove incompletely adsorbed material; Any remaining 
available suiPfabes of the 1 wells are then "coated with a nou^pedific protein that is antigenically 
' neutral With re^rd id tfre te^ and 
20 solutions of milk powder. The coating all^ thb 
r iihmobilizing surface and thus reduces the background e&usecl by nonsf^iiic binding of antisera 
* %nto&fc^ " - ■ :V ' •■'"u^/> > Ar f - /:u , , ,* 

di^ p^eatire. Thus, after binding of '«nM^^ : '!ti^ 'A»"^«1H^^atittg ^vitlk 4i non- 

25 reactive material to reduce fek^^ 

immobilizing; surface is contacted with the control and/or clinical or biological sample to be tested 
under conditions effective to allow immunefe^^ Detection of the 

^ ^ labeled §&*dhdary biiidihfe ligMor antibody, or a secondary 

id ^rtjtihc^rf with a labeled tertiary aftWbbdy 6f third biriding ligand. 
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"Under conditions effective to allow immunecomplex (antigen/antibody) formation" means 
that the conditions preferably include diluting the antigens and antibodies with solutions such as 
BSA, bovine gamma globulin (BGG) and phosphate buffered saline (PBS)/Tween. These added 
agents also tend to assist in the reduction of nonspecific background. 
5 The "suitable" conditions also mean that the incubation is at a temperature and for a period 

of time sufficient to fallow effective binding. Incubation steps are typically from about 1 to 2 to 
4 hours, at temperatures preferably on the order of 25° to 27°C, or may be overnight at about 4°C 
or so. y 

Following all incubation steps in an ELISA, the contacted surface is washed so as to 
10 remove non-complexed material. A preferred washing procedure includes washing with a solution 
such as PBS/Tween, or borate buffer. Following the formation of specific immunecomplexes 
between the test sample and the originally bound material, and subsequent washing, the occurrence 
of even minute amounts of immunecomplexesmay be determined. 

To provide a detecting means, the second or third antibody has an associated label to allow 
15 detection. Preferably, this is an enzyme that generates color development upon incubating with an 
appropriate chromogenic substrate. Thus, for example, one may contact and incubate the first or 
second immunec^niplex with a urease, glucose oxidase, alkaline phosphatase or hydrogen 
peroxida^^ period of tune and under conditions that favor the 

development of further immuneconiplex formation (e.g., incubation for 2 hours at room 
20 tempera^ 

After incubation with the labeled antibody, and subsequent to washiqgto remove, unbound 
material, the amount x>f label is quantified, e.g., by incubation with a chromogenic substrate such as 
^ijire^^o^l pufle or 2,2 , -^do^r(3^thyl> r benzthiazolm [ABTS] and 

Yl-fizy Vb the case of peroxidase as the; enzyme labeL Quantitation is then achieved Jby measuring 
25 the degree pf color generation, e:g., using a spectrophotometer. 

.. :-rSii $J$& of Antibodies/or Radioimagirtg ... n . ; 

the encoded mqrkprproteins. The ?mfibqdy* for example, may be labeled by^ anyone of a variety of 
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methods arid used to visualize the lotaliied concentration Of the cells producing the ericoded 
protein. 

A radionuclide may be bound to an antibody either directly or indirectly by using an/ 
intermediary functional group. Intermediary functional groups which are often used to bind 
5 radioisotopes which e&ist as metallic ions to antibody are diethylenetriaminepentaacetic acid 
(DTP A) and ethylene diamirietetriacetic acid (EDTA). Examples of metallic ions suitablie foir use in 
this di^osure^e ^^^^^^C^Ga, *Cfc ^ "Zr, and^TL 

In accordance with' this disclosure, the monoclonal antibody or fragment thereof may be 
labelfed by any of several techniques known to the ait. The methods of the present disclosure may 
1 0 alsoiise paramagnetic isotopes fbr purposes of in wvo detection. Elements particularly useful in 
" include ,57 Gd, 55 ftfii, '^Dy, ^Cr, arid 56 Fe. 

Administration of the labeled antibody may fee local Or systemic and accomplished 
intravenously, intraarterial, via the spinal fluid or the like. Administration may also be 
intradermal or intracavitary, depending upon the iro^ After a sufficient 

15 time has lapsed for the monoclonal antibody or fragment thereof to bind with the diseased tissue, 
for example 30 miriutes to 48 hours, the a*ea of the subject Under investigation is exiamined by 
routine imaging teclMicjiies such as MRI, SPECTT, plaftar scintillation imaging and emerging 
imaging techniques, as wen. The exact protocol necessarily varies depending upon factors specific 
to the patient, is rioted aibove, arid depending upon'- the body site under examination, method of 
20 administration arid type of label used life determination of specific procedures is routine in the art 
ITie distribuSori of the bound 5 iiadioactive isotope and its increase or decrease with tone is then 
monitored and recorded. By comparing tlife ' ^stilts vioth datai obtained from Studies of clinically 
in&mal iridividuais,lhie r j^t^enc^ ^d extent of the diseased tissue niay be detenhined. 
r ^ The iri^^ felosure add state cells by their effect On gerie 

25 expression in immune system lymphocytes. In early stages of the disease state* socft immune 
rm - r - " re^riserii^l^ loc^ized: For triple, the response mdy be limits to lymph nddfes immediately 
m sun^diiig a%^ksMkg turiibr or other locdl^ ibife 6f a disease itkte^ t0C^2^i0n of 
differentially expressi^iM& itate markers iri# bdr W iMIi# ^ ^^i»g disuse states of 
^ %^ a detection 
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means is therefore of significance in the management and care of patients with the disease statp. It 
will be recognized that this utility is included within the scope of the present disclosure. 

6. Kits . 
5 In still further embodiments, the present disclosure concerns immunodetectionkits 

. for use with ihc immunodetection methods described above. As; the encoded proteins or peptides 
may be employed lo detect antibodies and the corresponding antibodies may be employed to detect 
encoded proteins or peptides, either or both of such components may be provided in the kit. The 
immunodetection kits thus comprise, in suitable container means, an encoded protein or peptide, or 
10 a first an ti body that bi nds to an encoded protein or peptide, ^nd an immunodeteqtionreagent 

In certain embodiments, the encoded protein or peptide, or the first antibody that binds to 
the encoded protein or peptide, may be bound to a solid support, such as a column matrix or well of s 
a microliter plate. 

The immunodetection reagents of the kit may take any one of a variety of forms, including 
15 those detectable labels that are associated with or linked to the given antibody or antigen, and 
detectable labels that are associated with or attached to a secondary binding ligand. Exemplary 
secondary ligands are those secondary antibodies-that have binding affinity for the first antibody or 
antigen, and secondary antibodies that have binding affinity for a human antibody. 

■ Further suitable immunodetection reagents for; use. in the present kits include the two- 
20 component reagent that comprises a secondary antibody that has binding affinity for the first 
antibody or antigen, along with a third antibody that has binding affinity for the second antibody, 
;^ . 

Thevkits may further comprise a suitably aliquoted cpmpositionpf the encoded protein or. 
polypeptide antigen, whether labeled or unlabeled^ as may be used to p^pare; a standard curve for a 
25 : detection ; . ...... ^ . ,.. _ . 

kits may contain antibody-label conjugates ei^er in/fully conjugated form, in the form 
\ -x ^ conjugated by. the y$er of Ubi^^iit. The .components of 

the kits inay be papJkaged either in aqueous media or ii> Jyq^lul^dfonn. ; - : , ; ; : ie ■ 
• i: The coptainer rneans of the late genenJly includes at least ,one yial^ test tubp, flask, bottle, 
30 syringe or other container means, into which the antibody or antigen may be placed, and preferably, / 




/ 
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suitably aliquoted. Where a second or third binding ligand or additional component is provided, 
; the kit also generally contains a second, third or other additional container into which this ligand or 

Corhporieht may be placed. The kits of the present disclosure also typically include a means for 
containing the antibody, Antigen, and any other reagent containers in close confinement for 
5 commercial sale. Such containers may include injection oir blow-molded plastic containers into 
which the desired vials are retained, 

E. Detection and Quantitation of RNASpecies 

6ne embodiment of the instant disclosure comprises a method for identification of a disease 

10" State in a biological sample by amplifying and detecting nucleic acids corresponding to disease 
state markers. The bidlogical sample may be any tissue or fluid in which lymphocyte cellsmight 
be present. Various embodiments include bone marrow aspirate, bone maitow biopsy, lymph node 
aspirate, lymph node biopsy, spleen tissue, fine needle aspirate, skin biopsy or orgaii tissue biopsy. 
Other embodiments include samples of body fluid such as peripheral blood, lymph fluid, ascites, 

15 serous fluid, pleural eflfbsion, isjniftim^ cerebrospinal fluid, lacrimal fluid, stool or urine. 

Nucleic acid used as a template for amplification is isolated from cells contained in the 
biological sample, according to conventional methodologies. (Satribtook et. til, 1 989) The nucleic 
acid may be genomic DNA or fractionated or whole cell RNA \VTiere RNA is used, it may be 
desired to convert the RNA to a complementary cDNA. In one embodiments the RhTA is whole cell 

20 1^ 

Pairs of primers that selectively hybridize to nucleic a^fcids corf e^oiiding to disease state- 
specific markers are contacted with the isolated nucleic acid trndjer conditibhs that permit selective 
hybridization. Once hybridized, thfe nucleic acidtprimef conipiex is contacted with one or more 
"' enzymes that facilitate template^de^erideni nucleic acid synthesis Multiple totitids of 
25 ampfificdtion,aIso referred to as "cycles," are conducted Until a sufficieiit aniount of amplification 

Next, the amplification product is detected. In certain a^icafi^ be 
• ? jpe^orihed by visual means. Alternatively * the detection may invol ve4ttdirecVid of the 

j^durt 
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label or even via a system using electrical pr thermal impulse signals (Affymax technology; Bellus, 
1994). 

Following detection, one may compare the results seen in a given patient with statistically 
significantreference groups of normal individuals and patients with the, disease state. In this way, it 
5 is possible to correlate the amount of marker detected with various clinical states. 

L Primers 

The term primer, as defined herein, is meant to encompass any nucleic acid that is 
capable of priming the synthesis of a nascent nucleic acid in a template-dependent process. 
10 Typically, primers are oligonucleotides from ten to twenty base in length, but longer 

sequences may be employed. Primers may be provided in double-stranded or single-stranded form, 
although the single-stranded form is preferred. 

2. Template Dependent Amplification Methods 
15 A number of template dependent processes are available to amplify the marker 

sequences present in a given template sample. One of the best known amplification methods is the 
polymerase chain reaction (jeferred to as PCR) which is described in detail in U.S. Patent Nos. 
4,683,1 95, 4,683,202 ^ 4,800,159, and in Innis er fli , 1 990, each of which is incoiporated herein 
by reference in its entirety. 

20 Briefly, in PCR, two primer sequences are prepared which are complementary to regions on 

opposite compleipentary strands of the marker sequence. An excess of deoxynucleoside 
|riphogrfmtes is added to a reaction mixtore along with a DN A polymerase, e.g. , j>qlymerase. 
If the marker sequence is present in a sample, the primers bind tp the marker and the polymerase 
causes the primers to be extended along the marker sequence by adding on nucleotides. By raising 
25 and lowering the temperature of the reactipn mixture, the extended primers dis^iate ftom the 
maricer to fonn reaction products, excess primers bind to the marker and to the jce^ctipfl products 
&jC^ is rq^ted, . ,.v v \ X ■ ■ ■ . ' ■ 

A reverse ^ 

amount of irdRNA ^ Methods of revea^ trans^ into cE>NA are well known 

30 and described in Sambrook et aL 9 1989. Alternative methods for reverse transcription utilize 
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thermostable DNA polymerases. These methbds are described in WO 90/07641 filed December 
21, 1 990. Polymerase chain reaction methodologies are well known in the art 

Alternatively, RNA species can be quarititated by means that do not necessarily require 
amplification by PCR. These means may include other amplification techniques, for example, 
isoth^imic amplification tec Such as the one developed t>y Gen-Prdbfe (San Diego, CA), and 

tfie ligase chain reaction ("LCR"), disclosed in EPA No. 320 308* incorporated herein by reference 
in its; entirely. In LCR, two complementary probe jpairs are ^ in the presence of the 

fargbt Sequence; each pair binds to opposite complementary the target such, that they 

ibiit. In the presence of a ligase, the two prdbe pairs link to form a single unit. By temperature 
xiyciirig, as in PCR, bound ligated units dissociate frdih the target and then serve as "target 
HSt^enc^" for ligation of Access prdWpaii^^ U.S: Patent 4^883^750 describes a method similar to 
LCR for binding probe pairs to a target sequence. 

^beta R^plicase, described! be used as 

still another amplificatiori method in the preset In this method, a replicktive sequence 

of RNk which h^ a regioh compfe^ target is added to a sample in the presence 

of ah RNA polymerase. The polymerase copies the replicative sequence which may then be 
detected.' " * ;: " : ' r: / ; ; • . : " 

Ati iso^ima^^ endonucle^es^ and ligases are 

^ed td achieve : the in^iificatidn of target molecules that 

' ^^h^^prihat^ in one sto^d of ia restrftitioh site m^ ^6 b^^efW iri the atri^H&Moii M nucleic 

r - l ' A ^Si&nd Mi^fame^ cS^r^^bift Isothermal 

-t&; rdfelc ^tisllibn: A sirnilai- iiiethdd; Called &<^f ^ 

^%V^I pixies throughout a regidii targeted for ain^^c^atioi^ followed by a repair reaction in 
which only two of the four bases are present. The other two bases may be added as bi^tinylated 
" &^v^^ A jntrul^^ SBA/ Target ^ecific^^^ may 
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in a sample. Upon hybridization^ the reaction is treated with RNase H, and the products of the 
probe identified as distinctive products which are released after digestion. The original template is 
annealed to another cycling probe and the reaction is repeated. 

Other amplification methods are described in GB Application No. 2 202 328, and in PGT 
5 Application No. PCT/TJS89/01 025,each of which is incorporated hereintby reference in its entirety, 
may be used in accordance with the present invention. In the former application, "modified" 
primers are used in a PGR like* template and enryme dependent synthesis. The primers may be 
modified by labeling with a capture moiety (e.g., biotin) and/or a detector nioiety (e.g., enzyme). In 
the latter application, an excess of labeled probes are added to a sample. In the presence of the 
10 target sequence, the probe binds and is cleaved catalytically . After cleavage, the target sequence is ; 
released intact to be bound by expess probe. Cleavage of the labeled probe signals the presence of 
the target sequence. 

•• ■>/. Other nucleic acid amplification procedures include transcription-based amplification 
systems (TAS), including nucleic acid sequence based amplification (NASBA) and 3SR. Kwoh 

15 et aly Proc. Natl AcacL Sci. USA 86:1 173 (1989); Gingeras ei aL y PQT Application WO 818/1031 5, 
incorporated herein by reference in their entirety. In NASBA, the nucleic acids may be prepared 
for amplification by conventional phenol/chloroform extraction, heat denaturation of a clinical 
sample, treatment with lysis buffer and minispin polumns for isolation af DNA and RNA or 
guanidinium chloride ^ extraption of RNA. -These amplification techniques involve annealing a 

20 primer which has target sg^cific sequences. FolIo\ying polymerization are 
digested with RNase H while double stranded PNA molecule? are heat denalw^d agairi.; In either 
case the single stranded DNA is made fully double strandeji by addition of second t^get specific 
primer^ followed by polymerizatiop. The fouble-^^ multiply 
to In an isothermal c^ are 

25 ; y reverse into double stranded DNA, and transcribed once against with a polymerase such 

as ^ or SP6. The: resulting products, whether ? truncated or complete, indicate target specific 

U > Dayey et a!. , EPA Nd^ 329 822 (m^orporated herein by reference in its entirely) disclose a 
< nticleic acid amplification process involving cyclically synthesizing single-stranded RNA 
30/ < <"ssRNA w ), ssDNA, and double-stranded DNA (dsDNA), whichmay be used in accordance with 
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~ : " the preterit inventioW. The ssRNA is a fits* template for a first primer oligonucleotide, Which is 
elongated by reverse transcriptase (fcN A-dependent DNA polymerase). The RNA is then removed 
frroiri the resulting DNA RN A duplex by the action of HbonucldaseH (RNase H, an RNase specific 
for RNA m duplex with cither DMA or RNA). The restftant s^DNA is a; second template for a ■ 
5 7 second primer; which a! stS Vnclude^ the sequences of an RNA fkilymerase promoter (exemplified by 
T7 RNA polymerase) 5* io its homology to the template. This primer is then extendedby DNA 
polymerase (exemplified b> the large "Klenow" fragment of E. colt DNA polymerase I), resulting 
in a double-stranded DNA CdsDN A") molecule, having a sequence identical to that of the original 
* RNA between the primers and Having additionally, att otie fend, a prbmbter sequence. This promoter 

10 ^eqtieince may fee used by the ^pprdpriatc RNA polymerase to mate many RNA copies of the 
DNA: Th^e Copies may then re-ertter the cycle leai^ng to vefy swift amplification. With proper 
choice of enzymes, this amplification may be done isothfchnally without addition of enzymes at 
each cycle. Because of the cyclical nature of this process, the starting Sequence may be chosen to 
be in the form ^ or RNA. ; ^ v ^ 

15 Miller ettil: 9 PCT Application S9/O6^G0 ^hcoijioratfe herein by reference in its 

entiifctyj discfo^ the hybridization of a 

promoter/primfef sequent 

inkiy RNA copies of %e' sequence This scheme i& iiot cyclic, i k, new templates are not produced 
frdhi the resultant RNA transcripts. Other amplification ihfcthbds include 'Yaee n arid "one-sided 
20^- ; W :J Prolim^ M,Ai ill: iPCR PROTOdOlM: AND 
* m APPU<SmONS 9 Academe Press, N.Y. 0996) fcrid &hs^WM 9 Pr6& Nat'l Acad. Set. USA, 

V ^ Methods based on ligation of two (pr more) oligonucleotides in the pf esenee^Of jliicfeic acid 
having the Sequ&he 6f the resulting n di^lig6huele^tide^ c thereby artipifyiiig the di- 
25 ^igbiiucleoSde, may afcft be used in the ampiifitatiori ^ Of the preseht invention, Vfo et al 9 
^homm 4:560tl 989), mfcoiporated herein by fef^hcS in its entirety, : »■ 
?H h An fex^mpleOf fcfcBffift^^ 
uslfrto ^uktttiiy RNA ill isbme applications is a nuclease protection assay , There ^e many different 
! 1 Versions o£ nuclease protection assays kriOWri to those p&diced>iri the &rL The chaMctferiistic that 
30 all versions of nuclease protection assays share in common is that they involve hybridization of an 
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anfisense nucleic acid with the RN A to be quantified. The resulting hybrid double stranded 
molecule is then digested with a nuclease that digests single stranded nucleic acids more efficiently 
than double stranded molecules. The amount of antisense nucleic acid that survives digestion is a 
measure of the amount of the target RNA species to be quantified. An example of a nuclease 
protection assay that is commercially available is the RNase protection assay manufactured by 
Ambion, Inc. (Austin, TX). 

3. Separation Methods 

Following amplification, it may be desirable to separate the amplification product 
from the template and the excess primer for the purpose of determining whether specific 
amplification has occurred, Iri one embodiment, amplification products are separated by agarose, 
agarose-acrylamide or poly aery lamide gel electrophoresis using conventional methods. See 
Sambroqke/a/., 1989. 

Alternatively, chromatographic techniques may be employed to effect separation. There are 
15 many lands of chromato^ the present invention: adsorption, partition, 

ion-exchange and molecular sieve, HPL>C, and many specialized techniques for using them 
including column* paper, thin-lay er and gas chromatography (Freifelder, 1 982). 

A^ example of a separation methodology is done ; by c^yalentiy labeling the 
oligonucleotide primers used in a PGR reaction with various types of small molecule ligands. In 
20 one such separation, a different ligand is present on each oligonucleotide. A molecule, perhaps 
an antibody or avidin if the ligand is biotin, that specifically Jt>inds to one of the ligands is used to 
coat the surface of a plate such as a 96 well ELISA plate. Upon application of the PCR reactions 
i^i^^^^aa^^^ pf such a prepared plate* the PGR products are bound with sp^ificity to the 
airf^c^ washing the f plate to remove unbound reagents, a solution containing a second 

25 i^lecul^rthat; binds to the first ligand is added. This second molecule is linked to some kind of 
reporter system. The seppnd : molecule only binds to the plate if a PCR prpd[iict Jias been 
©TP^P^ w^reby both oligonucleotide printers are* incorporated into the final PGR products. 
^1^^ detected md quantified m a commercial plate reader 

; i^ch a^ ELISA reaction^ are detected and quantified. An ELISA-Iike system such as the one 



okiciSrvSrv ,iwr» 



WO 98/24935 PCI7US97/22105 

61 

described here has been developed by the Raggio Italgene company under the C-Track trade 
name. 

4. IdentificatiOnMethOds 

5 V : Ahiplificatioh products ni^ 

marker sequences: One typiedl visualization method involves staining of a gel with ethidium 
bromide and visualization under UV light. Alternatively, if the amplification products are 
integrally labeled with radio- ^ the amplification products 

i^ay then be exposed to x-rky film of visualized under the appropriate stimulating spectra, 

10 following separation. 

- In one embodiment, visualization is achieved indirectly. Following separation of 
amplification products, a labeled, nucleic abid probe is brought into contact wito the amplified 
marker sequence! The probe preferably is Conjugated to a- chrornbphore but may be radiolabeled. 
In another embodiment, the probe is conjugated to a binding partner, such as an antibody or biotin, 

15 where the ottief mSehit^r bf the binding pair carries ia detectable moiety. 

In one embddiiiieht,- detection is by Southern blotting and hybridization with a labeled 
probe, the techniques involvdi in Southern blotthigare well known to those of skill in the art and 
may be found in many standard books on molecular protocols. See Saiiibrook et dL y 1989; Briefly, 
amplification products are separated by gel electrophoresis. The gel is then contacted witH a 

20 membrane, such as niftro^ ribn-covalent binding. 

Subsequently, tfle inenibrane is irbubated witka chromophore-coiy^ated|)rohe that is capable of 
y > - %ybn^ ia ti^pt amplification detection is by expose of the membrane to x-ray 

^Itf 

25 reference herein, wiMi discloses ah apparatus and method for the automated electrophoresis and 
4ran§fer of nucleic acids. The ^pabratus permits electrophoresis and blotting without external 
manipulation of the gel and is ideally suited to carrying out methods according to the present 
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. 5. Kit Components 

All the essentia! r^aierials and reagents required for detecting disease state markers in a 
biological sample may be assembled together in a kit This generally comprises preselected^ 
5 primers for specific markers. Also included may be enzymes suitable for amplifying nucleic acids 
including various polymerases (RT, Tag* etc.), deoxynucleotides and buffers to provide the 
. necessary reaction mixture for amplification. 

Such kits generally comprise, in suitable means, distinct containers for each individual 
reagent and enzyme as well as for each marker primer pair. Preferred pairs of primers for 
10 amplifying nucleic acids are selected to amplify the. sequences specified in Genebank Accession 
numbers £87451, T03013, X03558, M28I30, Y00787, SEQ ID NO:l, SEQ ID NO:2, SEQ ID 
NO:3, SEQ IDNO:4, SEQ ID NO:S, or SEQ ID NO:29. 

In another embodiment, such kits comprise hybridization probers specific for disease state 
markers, chosen from a group including nucleic acids corresponding to the sequences specified in 
15 Genebank Accession numbers D87451* T03013, X03558, M28130, Y00787, SEQ IDNOrl, SEQ 
ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:29. Such kits generally 
comprise, in suitable means, distinct containers for each individual reagent and enzyme as well as 
for each marker hybridization probe. ^ 

20 R Useof RNA Fingerprinting to Id entity M 

RNA firigefrprintmg is; a means by which RNAs isolated from many different tissues, cell 
types or treatment groups may be sampled simul^^ relative 
abundances vary. Two forms of this technology were developed simultaneously and reported in 
1992 as RNA fingeiprinting by differential d (Liang ^dPard^ et aL y 1992). 

25 (See also Liang and Pardee, JJ.S; Patent 5^2623 J J * incorporated herein by reference in its entirety .) 
Some of the studies described herein were performed similarly to Donahue et ah, J. Biol Chenu 
. 269^86(^8609^1994. r v/.v -Lv.^ i-.;,- ^:<i, t y;r'x 

All forms of RNA fingerprinting by PCR are theoretically similar but differ in thjeir primer 
design and application. The most striking difference between differential display and other 
30 methods of RNA fingerprinting is that differential display utilizes anchoring primers that hybridize 
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to the poly A tails of miRNAs. As a cbnsequeribe, the PCR product amplified in differential 
display are biased towards the 3* untranslated regions of rriRNAs. 

The basic technique of differential display has been described in detail (Liang and fedee* 
1992). Total cell RNA is primed for first strand reverse tr^scriptiori with an anchoring primer : 
5 'composed of bligo d T. The oligo dT primer is extended using a reverse transcriptase, for example, 
Moloney Murine Leukemia Vims (MMLV) Reverse tmhscrij)tase; The synthesis of the second 
strand is primed with an arbitrarily chosen bligonucleotide, using reduced stringency conditions. 
Orice the dvniblc-stranded cDNA has beeri syhiHesi2»d, attiplificatioh proceeds by conventional 
¥CR techniques- utilizing the same primers. The resulting DNA fihgerpriiit is analyzed by gel 
10 %ledrophl>rciis inui cthidium bromide staining or autoradiography. A side by side comparison of 
^fin^eVpriiiis bbiained from differerit bell derived RN As using the same 1 oligonucleotide primers 
identifies mRN As that are differentially expressed. ^ 1 i? 

RN A fingerprinting technology has been demonstrated as being effective in identifying 
genek that are differentially expressed in cancer cells (Liang a/., 1992; Wohg et al y 1993; Sager 
15 905^ 1995; An et M , 1995); The present 

disclosure utilizes the RNA fingerprinting technique or other techniques described herein to 
identify geties that are differdntially expressed in peripheral blood 6ells in human disease states. 

G. i)tsign and Theoretical Consider 
20 Reverse trarisctiptioh (RT) of RNA to cDNA followed by relative quantitative PGR (RT- 

PCR)lrti^y be used to determine the relative dbheentraitions df specific mRNA iped^M series of 
total cell RNAs isolated from peripheral blood of normal individuals i0 ^ 
%kte. fey ^dernffhing that the concentration of a specific mRNA spe^ie^ varied it isshown that the 
^eete^ mftftA species is differentially Sx^re^d. Tfiis 4^S^4i^»VB^A 

25 • 1 "to borifii^ RNA fingeiprinting are 

• \ •■' V': " ^ ^.^^^^ 

^ ^ ^ J pci^ the* ntimber bf mollies of the amplified target PNA irttafe^se % & factor 

^ ap^albitiiig two Witii ^Veryfcytle of the reacti on itntil soirie reagent becomes lifting* TKSfeafter, 
the rate of amplification becomes increasingly diiriiMshed tifttil thtirfe isKhbt Ihitf^&in the 
30 amplified target between cycles. If one plots a graph on which the cycle number is on the X axis 
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and the log of the concentration of the amplified target DNA is on the Y axis, one observes that a 
curved line of characteristic shape is formed by connecting the plotted points. Beginning with the 
first cycle, the slope of the line is positive and constant. This is said to be the linear portion of the 
curve. After some reagent becomes limiting, the slope of the line begins to decrease and eventually 
5 becomes zero. At this point the concentration of the amplified target DNA becomes asymptotic to 
some fixed value. , This is said to be the plateau portion of the curve. 

The concentration of the target DNA in the linear portion of the PCR is directly 
proportional to the starting concentration of the target before the PCR was begun. By detennining 
the concentration of the PCR products of the target DNA in PCR reactions that have completed the 

10 ; saipe number of cycles and are in their linear ranges, it is possible to determine the relative 
concentrations of the specific target sequence in the original DNA mixture. If the DNA mixtures 
are cDNAs synthesized from RNAs isolated frqm different tissues or cells, the relative abundances 
of the specific mRNA from which the target sequence was derived may be determined for the 
respective tissues or cells. This direct proportionality between the concentration of the PCR 

15 products? and the relative mRNA abundances is only true in the linear range portion of the PCR . 
reaction. 

The final concentration of the target DNA in the plateau portion of the curve is determined 
by the availability of reagents in the reaction mix and is independent of the original concentration 
of target DNAi Therefore, the first, cpndition that must be met before; the relative abundances of a 
20 mRNA species may be determined by RJVPCR for a collection of RNA populations is that the 
• eonqentrations of the amphfed PC^ pro sampled when the PCR reactions ar? in the 




ii-S^S<L " second condition that must be met for an RT-PCR study to ^ucc^ssftdly determine the 

relative abundances of a particular miy*JA ^ concentrations pf the amplifiable 

25 .., ; ^cD^ Sudani. The goal pf an RT-PCR study is to 

determine the abundance of a parfcular ml?NA spwies relative to f the ayera of all 

=»>.■ j ^iidlNA species in the sample, ; Iijt ^^e studies described b^low* mRlSIAs fpr 0-ac^n, asparagine 
> a? ^xtjemai and internal standards to >yluch ,Jlte r relative 
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M6st protocols For competitive PGR utilize internal PGR standards that are appirbriihately 
as abundant as the target. These strategies are effective if the products of the PGR amplifications 
are sampled during their linear phases. If the products are sampled when the reactions are 
approaching the plateau phase, then the less abundant product becomes relatively over represented 
Comparisons of relative abundances made for many different RNA sam£tes/sufcli as is the case 
v&eri examining RNA samples for differential expression, beCdme distorted in such a way as to 
mak£ differences in relative abundances of RNAs appear less than th^y actually are. This is not a 
* sigriifidaht problem if the internal standard is much mof e abundant than the target If the internal 
standard is more abundant than the target, then direct linear comparisons may be made between 
RNA Samples. 

The discussion above describes the theoretical fconsidefations for an RT-PCR assay for 
cliiuc^yde^^^'^ai^ab.' ■ The problems inherent in clinical samples are that they are of variable 
quantity (making notoali±ation pi i 6blematicX arid that they are of variable quality (necessitating the 
C6-^plii5fciWbh'of a reliable internal control, preferably of larger size than the target). Both of 
theise problems are overcome if the RT-PCR is perfbrinedas a relative quantitative RT-PGR with 
an internal standard in which the internal standard is an amplifiable CDNA fragment that is larger 
than the tairget cDNA fragment and in which the^bundance of the mRNA encoding the internal 
standard is roughly 5-1 00 fold higher than the ifcRNA encoding the target. This assay measures 
relativeabii^ 11 

^ Other studifes maiy Be performed using a more coftVentfonM relative quantitative RT-PCR 
^Oti^jx t£ct£fe These as^ys sampled PGR products in the linear jioJtion of 

their amplification curves. The number of PGR cycles that are optimal for sampling must be 
lmpmcal$ detemiinad for each target cDNA frngitoetat. In kdditi^n, the reverse transcriptase 
^dtiCts df feadfi RN A popiilafidn is6Med fr&^ 

hdtm^lized for ^ikl ? cdhcehitotions of dmplifiable cBWAs, WMle empm the 
li^^ ^eparatioi^ aifetedious and 

derived from a relative quantitative R1VPCR with an internal standard itMi. 

£i &h£ ite^oh fbr thisls; that without ^ may 
^?^m^tecT iiiito^a sSftgi^PGR prodnctinthe liti^r^ge ofto^^ 
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v sensitivity of the assay. Another reason is that with only one PCR product, display of the product 
on an electrophoreticgel or some other display method becomes less complex, has less background 

and is easier to interpret. 

5 FL Diagnosis and Prognosis of Human Cancer 

In certain embodiments, die present disclosure enables the diagnosis and prognosis of 
human cancer by screening for marker nucleic .acids. Various markers have been proposed to be 
; correlated with metastasis and malignancy. They may be classified generally as cytologic, protein 
or nucleic acid markers. 

10 Cytologic markers include such things as "nuclear rouridedness" (Diamond et aL 9 1982) and 

• cell plpidy. Protein niarkers include prostate specific antigen (PSA) and CA125. Nucleic acid 
markers have included amplification of Her2/wew, point mutations in the p53 or ray genes, and 
; changes in the sizes of triplet repeat segments of particular chromosomes. 

All of these markers exhibit certain drawbacks, associated with false positives and false 
15 negatives. A false positive result peeing when an individual without malignant cancer exhibits the 
presence of a "cancer marker". For examples elevated serum PSA has been associated with prostate 
carcinprna. However* it also occure in spme individuals with non-malignant, benign hyperplasia of 
: ^ : thppi;o$tatCi .-A false negative result occurs when^M has cancer, but the test fails 

to show the presence pf a ^^ifiq marik^r^ The incidence of false negatives varies for jeach marker, 
20 and frequently also by tissue type K For example, nas point mutations haye been reported to range 
ftoin a high of 95 j^rcent in pancreatic cancer to a low pf zero ..pmG&e^'^\50inc; gyne*x>logic 

'^Cancers;.-, ....... ,^-rii,S-:.. : ; : -.- z'J f x':^>- ■■- , >- - • ;\ - • ~ '" •- — . ' • 

• r Adi^tional problems arise wfaeii a marker is present only within the transformed cell itself. 
v^/HRas^ cell, and 

25 i- : fw (^arnp^ the serum or urine of mdiyi<i^s with reactivated carcinomas. This means that, in ^ 

or its metasfcarffc cells, 

Es^tiaj^^e befor^ the pjrcsence of Ae c^c^r marker 

may be detected. .^\:;%>iKfe } ^M- - ^fu :>'iX V ^ ■ ^v^\ 

^ ; ! ^ but absent in 

30 m and / tnffl^ggmed cells. 
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If one is searching for a marker that is present in normal cells, but occurs at reduced levels in 
transformed cells, the "background" signal from the normal cells ih the sample may mask the 
presence of transformed cells. 

The Meal disease state marker would be one that is present iri individuals with the disease 
5 state, and either missing or expressed at significantly lower levels in normal individuals. The 
present disclosure addresses this need, iri the case of metastatic prostate cancer for example, by 
identifying several new nucleic acid markers that are expressed at higher levels in individuals with 
metastatic prostate cafrcer than in normal individuals* Ih particular, the results for markers UC3Q2 
(SEQ ID #3) and UC325 (SEQ ID #4) are quite promising in that these markers are apparently only 

10 overexpressed in the peripheraU blood of individuals with metastatic tumors and are present at 
relatively low levels iri normal individuals. 

Further, since the iriarkers are present iri the whole blood ot individuals with the disease 
state, the present detection method avoids the problem of having to suspect a tinrior is in place 
before it may be sampled. The instant disclosure has utility as a general screening tool for 

15 asymptomatic individuals, as well as a means of differentially diagnosing those patients whose 
tumors have already metastasized. Depending upon the type of tumbr involved, such individuals 
may be selected for systemic forms of ianti-caneer therapy rather than surgical removal of localized 
tuiribr masses. Certriiri individrials wth advmiced forms of highly ni^ignarit metastatic tiim^ may 
be Optimally treated by pairi management alone. 

20 It is anticipated that in clinical applications j hum&n ti^ud sariiples^ll be sdreeried for the 

prfc^erice 6f the disease state rii^kete identified herein. Such saffiriples worilci ^oiteailly consist of 
j^teiipheial bloody but iriay also consist of needlfe biopsy cores of i^4riph node tissue, ih certain 
embodiments, nucleic acids Would be extracteel frbin thessfe sanities arid ^pllfied as described 
ab6ve. Soirie embodiments would utilize kits containing pre-Selccted prime* pairs or hybridization 

25 probes. Thie ^amplified nucleic acids would be tested for the riiiarkek by, ibr example, gel 
electropfeordas &rid l^idhim bromide staiiiing, or Southern bldttihg, or a solid-fihase detection 
irieans? as dCj^b^ abfeve. These methods; ard well kritf\tti within the iart: : llie levels of Selected 
V-- v ;- markers detected woukl t^com^^i^ 

30 the individual patient would be determined by coi%)arison Wito sri^h groups. - 
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Another embodiment of the present disclosure involves application of 
tp detect a disease state using probes and primers selected from sequences comprising Genebank 
Accession numbers -D87451, T03013, X03558, M2813G, Y0O787, SEQ H) NO:!, SEQIDNO:2, 
SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:29. Similar techniques have been 
5 describedin PCT Patent Application No. WO 94/10343, incorporated herein by reference. 

In this embodiment, the disease state is detected in hematopoietic samples by amplification 
of disease state-specific nucleic acid sequences. Samples taken from blood or lymph nodes are 
treated as described below to purify total cell RNA. The isolated RNA is reverse transcribed using 
a reverse transcriptase and primers selected to bind under high stringency conditions to a nucleic 
10 acid sequence from a group comprising Genebank Accession numbers D87451, T030l3 > X03558, 
M28130, Y00787, SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ II? NO:4, SEQ ID NO:5, or 
SEQ ID NQ:29. Following reverse transcription, the resulting cDNAs are amplified using 
conventional PCR techniques and a thermostable DNA polymerase. 

The presence of amplification products corresponding to disease state-marker nucleic acids 
15 may be detected by several alternative means. In one embodiment, the amplification product may 
be detected by gel electrophoresis and ethidium bromide staining. Alternatively, following the gel 
electrophoresis step the amplification product may be detected by conventional Southern blotting 
techniques, using an hybridization probe selected to bind specifically to a disease state-marker 
nucleic acid sequence. Probe hybridization may in turn be detected by a conventional labeling 
20 means, for example, by incorporation of [ 32 P]-nucIeotides followed by autoradiography. The 
^plification pn^u^ alternatively bp detected using a solid phase detection system such as 

■_ y ,-,. :/ ;,tbgse. utilizing a disease state-marker specific hybridization probe and an appropriate labeling 
means, ore known as C-track™ as described above. . 

l^v:'- The following examples are included < to demonstrate preferred embodiments of the '.. 
25 invention. It should be appreciated by those of skill in the art that the techniques disclosed in the / 
, r ^>^z^^ : ^^(^; follow ^present tech^ to function wgU in the 

practice qf the jnvention, and thus may be ponsideredto ^f^pstitute preforod modes |br its practice. 
However^ those of $kill in the art should, in lifjh* of the present disclose, appreciate that many 
: ^ cl^g^ may be made in the particular embodiments which are disclosed and still obtain a like or 
30 simile result without de^ \:^. r 
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L Materials and Methods - 

A Application oj ~RN r A fingerprinting to discover biomarkers for disease s^^^ 

V - ..." 

RNA fingerprinting (according to Liang arid Pardee, 1992; Welsh et at; 1992;: 
5 Liang and Pardee, 1993) was applied to nucleic acids isolated from the peripheral blood of . 
individuals with metastatic prostate cancer, compared with normal individuals. 

Blood was dra\vri iribm cancer patients arid nonrial individuals into Vacutainer CPT tubes 
^■""SvitH'ficbl gradients (Becton Dickinson and Company/ Frankih Lahe^ The tubes were 
^ ceritrifuged to separate the red blood cells from various tyj>es of nucleated cells^ collectively 
10 ' rfeferi^ to as the bufify eba^ and from blood p&Siiik Tota^ cell RNA 4 was isolatedUfrom the buffy 
V boats by the RNA STAT-60 inethtxl (Tel-Test, Inc., Fri^hdswo^; TX). After RNA isolation, the 
:y y nucleic ^ and 
redissolved in water. The redissdlVed nucleid^ I 
(Boehringer Mannheim, Inc:) following the mahufactiifdr's instructions, followed by organic 
1 5 extraction with phenol rchloiftfonn nsbato^ ethariol. 
The DNase 1 treated RNA was t^^ 
The purity arid cohc0ntration of the RNA in solution was Estimated by determining optical- density 
at'v&ve lengths of 260 hm and 280 niti (Saftibrdok ^/ 1989). The RNA was then examined by 
electfophoreisison The 
20 RNA Was then divided into thr6e iiliquots. One aliquot waS set aside for relative quantitative RT- K 
PGR confirfcati6h using the external standard inethod ^esmbedbelo i \ 
v * A second aliquot was tilsed to fihgerpriiit the RNA by converting the RNA to first strand 
cDNA Using i^daM^ex^^ cDNA by PGR using 

- i^itftriiy chosen Wig^nu<aiei>tid^ (1 0 hucte&ides: rin^leripi);?^i^daying tM resulting PCR 
25 amplified products on an agarose gel st^ed;M^ 

*8s appearing b^dsont^f the geL Hife ex6i gedfcands were then cloned andsequencMii . 

ihdM with metastatic prostate c^n<5er. The 

^ I>o^s fMpipiftted tising the sequential pairwise-ta^ of wbitrarify primed PCR 

30- fingerprinting of RNA (McClelland et aU 1994, Nucleic Acids **#jptffl^ 
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incorporated herein by reference) with several changes. For example, arbitrary oligonucleotides of 
15 to 24 nucleotides were used with Taq polymerase, and one tenth of each first strand cDNA 
reaction in each arbitrarily primed PGR reaction. One hundred and 200 ng were used in each first 
strand cDNA synthesis, respectively. Certain genes disclosed herein werie discovered by the 
5 sequential pairwise method. . 

2^ Methods Utilized in the RNA Fingerprinting Technique 

The second type of RNA fingerprinting studies performed more closely resembled 
\ • ■:• • • 

) - . the protocol of Welsh et aL (1 992), This approach used a variation of theabove as modified by the 

10 use of agarose gels and non-isotopic detection of bands by ethidium; bromide staining (An et aL, 

1995). Total RN As were isolated from peripheral blood samples as described ((^omczynski & 

Sacchi, 1987), Ten micrograms of total cellular RNAs were treated with 5 units of RNAse-free 

DNAse I (GIBCO/BRL) in 20 mM Tris-HCl (pH 8.4), 50 mM KC1, 2 mM MgCl 2 , and 20 units of 

RNAse inhibitor (Boehringer Mannheim), After extraction with phenol/chloroform and ethanol 

15 preeipitatibn,the RNAs were redissolvedipPEPC-tr^ated water. 

Two jig of each total cell RNA sample was reverse transcribed into cDNA using randomly 

selected hexamer primers and MMLV reverse transcriptase (GIBCO/BRL). £CR was performed 

using one or two arbitrarily chosen oligonucleotideprimeir^ (1 0~12mers). PCR conditions were: 10 

mM vTris-HCl (pH 8.3), 50 inM KC1, 1 .5 mM 1 unit 

20 of T^aq DNA polymerase (GIBCO/BRL) in a final volume of 2Q nil. The amplification parameters 

included 35 cycles of reaction wit^ and 60 sec 

e^etosion at 72?C. A final extens^oiivat 32^^ PGR 

^todupts were resolved into a fingerprint by size separation by electrophorpsis through 2% agarose 

jgels in TBE buffer (Sambrook et a/., 1989)^ , The fingerprints; were visualized by staining with 

25 : ^ . : . £0 

Differentially appearing PCRp^ 

J ^ werctexcised from the gel with 4 ra^r JN^% purified from the agarose using the Geneclean kit 

•*d : <(Si<£T ^ i^ing the TA t cloning 

1 V>; ' s strategy (Invitrogen, Ihc^ and Promega, Inc^ Thtase prcnluc^ were not re^ 
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5. Cor^rniation of Differential Expression by ^Relative QUantiiat^ 
Protocols for RT-PCR 

d; keverse transcription 
5 Grie to fiV6 jig of total cell 

ihto cDNA. Reverse transcription was pfeifotmed with 400 units ''6t^i^/El^;'i^^x transcriptase 
; (GIlBCO/BRJL)in the presence of 50 ifctil Tri^HGl (pH S3), 7$ ti^KCl, 3 mM MgCl^, 10 mM 

DTT, 500 mM dNTP, 50 ng ^dom h^i^ers per m 1 U/itil RNase 

inhibitor. The reaction volume was 60 ml. The reaction mixture was intubated at room 
10 temperature for 10 minutes, then at 37°C for 50 minutes. After reverse inscription the enzyme 

was denatured by heating to 65°C for 1 0 minutes. After heat denaturation the sariiples were diluted 

witli water to a final voluriie of 300 ml. 

&T-PCR was utilized to examine mFiNAs for differential expression; The sequences of 

oligonucleotides used as primers to direct the amplification of the various cDN A fragments are 
15 presented in Table 3. 

b Relaihte&anttiati^ 

* The concentrations of thfe original total cell MNAs were determined by me^suiement 
of OD 26 o^8o (Sanibrook et dt 9 1989) and confirmed by examination of ^'^^^^ on 
20 ethidium bromide stained agarose gels. It is required that all quantitative ^ ^GR reactions b6 
, normalized for equal amounts of amplifiable cDNA after the reverse transcription is completed. 

One solution to this is to ternuh&e rafct^^ into plateau phase. 

% This approach was utilized in some studies becausb it isxjtiick arid efficient. Lipocortin II was used 
as the internal standar^^ir tompe^itbr. These ^CRs were set Up ^ fMl6\vs: 

' ^R^gtiits: 200 ri£M each dNfP, 200 riM each 6ligbriude6tiafepririierj IX PGR bttffex (B^Bringer 
lA^inhcnA including 15 i^ 'tif^d, i %f dil^ 
' poi^merase/1 OOrhl of ration vbluitie. " n ^ * s ^ ^ 
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Cycling parameters: 30 cycles of 94°C for 1 min; 55°C for 1 min; and 72°C for two min. 
Thermocyclers were either the MJ research thermocycler or the Stratagene kobocycler. 

c. Relative Quantitative RT-PCR with an External Standard 
There are three problems with the relative quantitative RT-PCR strategy described 
above. Firsts the internal standard must be roughly 4-10 times more abundant than the target for 
this strategy to normalize the. samples. Second, because most of the PCR products are templated 
from the more abundant internal standard, the assay is less than optimally sensitive. Third, the 
internal standard must be truly unvarying. The result is that while the strategy described above is 
fast, convenient and applicable to samples of varying quality, it lacks sensitivity to modest changes 
in abundances. 

To address these issues, a normalization was performed using the B-actin mRNA as external 
standard. These PCR reactions were performed sufficient cycles to observe the products in the 
linear range of their amplification curves. The intensities of the ethidium bromide stained bands 
were documented and quantified using the Isl 000 imaging analysis system manufactured by the 
Alpha Innotech, Corp. The quantified data was then normalized for variations in the starting 
concentrations of amplifiable cDNA by comparing the quantified data from each study with that 
derived from a similar study which amplified a cDNA fragment copied from the fi-actin mRNA 
(Quantified data that had be;en normalized to beta actin were converted into bar graph 
representations. , ^ 

W ; w 

; • a Specimen Collection . n : 

Blood specimens (8-10 nals) were ^llecte^ by vemp^cture into standard serum 
or serum-separating tubes (Becton-Dickinson), allowed to coagulate for 30 minutes at room, 
temperator^, and then centrifuged at lo\y speed (lOOOx for 10 minutes. Some , sgpepimens 
; commg were' immediately frozen auid shipped oyernight by delivery courier. Others were 
collected, processed, frozen, and shipped on dry ice? by pyprnight mail. Upon airival, all 
specimens were stored at -20°C. Repeated freeze-thaw cycles were avoided. 
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6. Measurement of Free and Total PSA 

Two commercially available assays were utilized to measure PSA concentrations, 
an IMMULITE solid-phase chemiluminescence-based assay to measure free PSA (Diagnostic 
Products Corp.; Los Angeles, CA), and the FDA approved assay from TOSOH (San Diego, CA> 
5, that utilizes an enzyme-conjugated monoclonal antibody and fluorescent substrate to measure . 
t<kal;PSA. However" since two different instruments were utilized to measure the components of 
the f/t PSA ratio, the international reference standards for free and total PSA were utilized to 
calibrate both assays and calculate the "corrected" f/t PSA ratio (Stamey, 1995), 

10 c. f/t PSA Reference Standards and Correction of f/t PSA ratio 

The corrected f/t PSA ratio was determined according to Marley et dh 9 1996. 
Reference standards for free and total PSA assays were purchased from the Stanford University 
Prostate Center and consisted of an equimolar mixture of 90% PSA-a-l-antichymotrypsin and 
10% free-PSA (Stamey, 1995; Chen et al y 1995). All testing dilutions were performed with 1% 

15 bovine serum albumin (Fraction V; Sigma Ghemicail Co.) in 20 mM phosphate-buffered saline 
(PBS), pH 7.4. Expected concentrations of the reference standards, determined from molar 
extinction Coefficients ^ 

• : Free and total $SA assays were standardized as Tbllbws; Based upon the mean of seven 
lin^is^ 

20 and t6tal PSA mea^ement were 'calculated^ Slope (m) deviatidris Ayere m^nred relative to the 
• liri^plotbas^u tidFthie refe Since 

all curves passed through the origin, the correction 5 factor fbr the free/total PSA ratio was 
cMcuiatbd fr»m the in slbjpfes • Intia^s&y c^ PSA (range 

The 

23 ^ corrfebtiori free Srid tdtaf PSA valufes ^ei^e 1 ,i9%iui 0;S3; rdspectively • For > 

? ? ilie CfOSOH) total PSA assay teafeted equaHy: t© the free ^d iknintf (PSA-ACSP) forms 
oJFPSA. lie (Inimulite) free PSA assay system w^ tinable to detect the bound fraction of PSA 
}? V j 1 y ^ PSA 
30 ' " were unable t& detect PSA coyaleritly linked ttf d-2 ma<xbj$db\&W(PS or occidt PSA). 
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rf Stati stical Methods 

Pyrenees in free and tota? serum PSA data betvveeri BPH . > 

vl.01, SYSTAT roc.,. Evans,™. , L) , was used . .JV^ Tree (CART) analysis (CART 

■« ^ as we,, as the iogistic ^ J ^ ~" " 

,992) ™= — o„ vaK.es of .He in^nden, pan^ ^ ^ d ^ ^ 

STATA™ software package. P»™»«cr. were also determined using lhe 

^ IL-8 Quantitation 

15 « Phase ^"^oTT ^ ^ * ^ '» *> — <*« 

«.4 McKinley Pi. NE; ££££££ HT ^ ^ ^ ^ ' 
-Monrco.orreagen.s^ 

»L-a stock solution (2000WndV t <K ~* n **-»'*»^ solution and 

» ™ pip e ltt(i LTeT: 'a seZ^ J ^ " " ^ *~ 

— <~ ^ was ^szsss^p^. ^ n * 

* f T^ 

reagents and samnles wen. «„, >. u "«a to assay IL-g concentrations. All 

v """P"* first brought lo room temperature The ~.w. . 

25 each well; 100 „1 of assay diluent RD1-8 50 ... f , ^ ""<^ ^ 

bufFerforatotal of six wnch a* '"' * ^ assay well was aspiratedand washed with wash 
* i^n»3b^ 
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and mixed by gentle tapping. Optical density was measured within 30 min of addition of stop 
solution, using a Bio-Tek EL-808 microplate reader (96 well format) at 450 nm. 

/ IL-8 Standard Curve and Coefficient of Variation (CV) 
5 : The IL-8 standard curve consisted of eight concentrations: 1000, 500; 550, 125, 

62.5. 31.2. 15.6. 7.8 pg IL-8/mL The mean of six different measurements of each standard 
dilution was plotted (x-axis) vs. the miean optical density measured <y-axis). Results were 
pfottcd usinp the KC3 software package (Bio-Tek instruments; Winooski, \VT). 
: > CfWIilcicnt of variation (CV): From the eight data points for each concentration of the 
10 standard curve. Coefficient of Variation (CV) = Standard £>eviation/Mean was calculated to be A 
6.9, (3.4. 1 1.1/! 6;1 ;"4.5. 4.4, 1 3.0 and 34.1%, respectively for the standard curve concentrations, 
listed above. Points with a CV of greater than 13% were not titilized for this study. 1 

" K. " EXAMPLES 

15 Exarhple J %: \ 

Re/at iyeOkanii 

A method to evaluate novel genes (EST s) as diagnostic biqmarkers. 

The reverse transcription-polymerase chain reaction (RT-PCR) protocols described in the 
20= ■, , ; ^followiSig examples were developed as a means to determine the relative abundances ; of mRNA 
species that are expressed in various tissues, organs zmd ee^ 

asapptiedjo prostate tissue in US Application Serial No 08/692,787, incorporated in relevant part 
herein by reference* The; protocols med.tomeet this need must be robust, reproducible relatively 
• w ^^qu^titotrvfe^ sensitive, conservative in its use of resources, rapid and hav& a high thrpughpwt rate. 
25 , - Relative quahtetative RT^PCRJhas the techni that* in theory, meet all of these .criteria. 

:5 ;ln pracHc^ there -are six important barriers to implementing an RXtPGJR based that 
^.comjpa^ The protocol described herein addresses 

v ^^^ea^0f^se v six barriers and has peimitted the realizatioiiKO^th^ potential pf RTTrPCR for this 
! & ? tippiu^Mn: Altfiough the present example is <Jrawp,to the identification ancl congestion of 
3Qi v i differential expressi in various physiological states in prostate tissue, the method^^escribed 
: herein maybe applied to an# type of tissue, and particularly to peripheral Wood cellsto provide a/... 
sensitive method of identifying differential expression. 
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The inventors have described the examination of candidate genes by this method that 
were partial cDNA fragments identified by RNA fingerprinting methodologies. This necessitated 
development of a relatively quantitative approach to independently confirm the differential 
expression of the mRNAs from which these partial cDNA fragments were derived. The key 
5 objective of the described screening protocol is the assessment of changes in the relative 
abundances of mRNA. 

The gene discovery program previously described is focused on analysis of human tissue 
and confirmation must be performed on the same biological material. Access to human tissue for 
isolation of RNA is limited. This limitation is especially problematic in Northern blots, the 
10 traditional means to determine, differential gene expression. Northern blots typically consume 
roughly 20 \ig of RNA per examinbd tissue per gene identified. This means that for the average 
size of tissue sample available, only 1-5 Northern blots can be performed before all of the RNA 
from a tissue sample is completely consumed. Clearly Northern blots are seriously limited for 
primary confirmation of discovered genes and consume extremely valuable biological resources 
1 5 required for gene discovery and characterization. 

Because of such limitations on the amount of available tissue, and because of the need for 
high throughput and rapid turnaround of results, a two tiered assay protocol was developed that 
; is technologically grounded on reverse tr^cnptidri (RT) of RNA into cDNA followed by 
amplification cDNA sequences by polymerase chain reaction (PGR). This coupling of 

20 techniques is fi^equently referred to as RJ-PCR. 

Ohe advantage of RT-PCR is that it consumes relatively small quantities of RNA. With 
2^ Examined sample^ the amount of RNA required to pe^rm a single Noi^ern 

r tybt 6^ assays may be performed with up to^ fo 

: - Another advantage is a high throughput, eight independent experiments which examine eight 
25 ^different tiriRNA species' for differential expression may b£ performed simultaneously in a single 
i v PGR machine with 96 wells. A single individual skilled id this technique may the^by examine 
ancf evaluate eight j geiies> $>dt #ay without significant time constraints. By comparison, ;eveii if 
MN^}bf^ Sufficient qiiali^r and quantity were iavailable to do this numfer of Northern blots, a 
Mriai^ and 
30 evaluate eight genes jter week. In addition to the lower throughput rate of Northern bJots^ 
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Northern blots per week would require the consumption 6f about 400|LiCi of 32 P per week. While 
hot dangerous to use in the hands of a skilled individual, is certainly inconvenient to use. RT- 
PGR avoids the use of radioactive materials. 

An additional advantage of RT-PCR over Northern blots as a technological platform for 
5 evaluating the relative expression of mRNA species is that RT-PCR is much less sensitive to 
differences in tjtiality of the RNA being examined. Th6 human tissues described were removed 
from patients for treatment purposes and were only incidentally saved for further studies. Hence 
tfifc RNA, an extremely labile molecule, is expected to be at least partially degraded. Because the 
^Ats separated byisize 6n a gel in the Northern blot assay, partially degraded RNA appears as 
10 a smear ' rather than discrete bands. By contrast, RT-PGR amplifies billy a section or dewniain of 
an RNA molecule, and as long as that portion is intact, the size m degradation state of the entire 
inoletule is irrelevant. As a result, RNAs that are identical except that they vary by degree of 
partial degradation will give much more variable signals in a Northern blot than they will in an 
RT-PCR. When samples are of variable quality, as is often the case in human studies, the relative 
15 sensitivities of the techniques to variation in sample quality is 'ah important consideration. 

In the practice of this method, total cell RNA is first converted into ct>NA using reverse 
transcriptase priiiied with random hexamers. This pr&tpcol results in a cDNA population in 
which ^cli RNA has contributed according to its relative proportion in original total cell RNA. 
If two RNA species differ by ten fold in their original relative aburidantes in the total cell RNA, 
20 then the cDNA derived from these two RNAs vvili also differ by ten fold in their relative 
abundances in the resulting population of cDNA. This is a conservation of relative 
proportionality in the conversion of RNA to cDNA. 

Another cohsiddr^tibn is the relative rates of ^plificatibn of a targeted cDNA by PCR. 
in tfteofy/ the amount of to am^ffied prodiict synthesized by PGR will be eqfcal to M(E C ). 
25 Where^M il tht* mtos bf the^geteS cD^ 

f %timbCT bf cyd^ perfoniied; E is an efficiency of arhplifi6ation factbft This factor is 
%6mpfe£ ^ varii^ between I and 2. TTie irnjk)it^ht c&nsidarati^n in this as^y is that over most 
of a PCk ^plifiqation, E will be nearly constant aJid nearly equal to 2. M PGR re^ctidins ttet are 
idbhtieM in evety ^% exeept the cDNAs beingu&ed as templates are derived ft6fa different total 
30 cell RNAs, then E will have the same value in each reaction. If a cDNA target has an initial mass 
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• of M i in one PCR reaction and a mass of M 2 in another PCR reaction and if E has the same value 
in each reaction v then after C cycles of PCR there will be a mass of M|(E C ) of the amplified 
target in the first reaction and a mass of M 2 (E C ) of the amplified target in the second reaction. 
The ratios of these masses is unaltered by PGR amplification. That is M 1/M2= IMi(E c )]/M 2 (E c >. 
5 Hence, there is a conservation of relative proportionality of amplified products during PCJR. 

Since both reverse transcription and PCR may be performed in such a way as tp conserve 
^ proportionality, ii is possible to compare the relative abundance of an mRNA species in two or 
: more total cell RNA populations by first converting the RNA to cDNA and, then amplifying a 
. fragment of ihc.'cDNA derived from the specific mRNA by PCR. The ratio of the amplified 
10 masses of the targeted cDNA is very close to or identical to the ratios of the mRNAs in the 

original total cell RNA populations. 
I Six major challenges or barriers to be overcome in order to best use RT-PCR to quantify 

the relative abundances of RNA are as follows: 

15 1) Degradation of RNA must be minimized during RNA preparation.* 

2) Genomic DN A must be eliminated. 

3) RNA must be free of contaminants that might interfere with reverse transcription. 

r 4) The efficiency of ? RT is variable. cDNAs, f not RNA, must be nomialized for equal i 
concentotipnsofamplifiablecDN 
20 5) Limited linear range requires multiple sampling points in any amplification curve, 
o:-' 6) Tube to tube variability in PCR ^ 

p • A/ft i$ ithe developments Qf techniques to overcome these barriers and to prpyide a sensitive 
and accurate method of quanfi^ any tissue type, or Cjril type 

25 ; ; such as peripheral blood cells* or physiologicaLstate that is a part of the prefSCTt invention. 

pr .yi : ? « > The fii^t three barriers to successful RT-PCR are all related to the quality, of v th^ RNA 

:this section address the fir^ tw^ as 
^ :^^spribed nx the last section. These arc ^ de^datipft^of: must be 

fy;;] minimized during RNA preparation and that genomic DNAmust be . eliminated fjnom toe FN A. 
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Two preferred methods for RNA isolation are the guanidinium thiocyanate ixifethod, 
which is well known in the art, and kits for RNA isolation manufactured by Qiagen, Inc. 
(Chatworth* C A), with the kits being the most preferred for convenience. Four protocols are 
performed on the RNA isolated by either method (br any method) befbfe the RNA is be used in 
RT-PCR. 

The first of these four protocols is digestion of the RNAs with Dnasel to remove all 
genomic DNA that was co-isolated with the total cell RNA. Prior td DNasel digestion, the RNA 
is in a particulate suspension in 70% ethariol. Approximately 50 jig 6f RNA (as determined by 
OI*26oy28o) * S rem ^ ve ** OIil *h e suspension and precipitated. This RNA is resuspended in DEPC 
treated sterile water, to this is added 10X DNasel buffer (200 mM tris-HCI; pH 8.4, 20 mM 
MgCi 2 , 50t) mM KC1), 1 0 unite of RNiise Inhibitor (GIBCO^I^;C^i55l 8-012) and 20 units 
of DNasel (GIBCO-BftL # 18068-015). the volume is adj^ted to 50 |il with additional DEPC 
treated water. The reaction is incubated at 37°C for 30 ihinutes. After E>NaseI digestion the 
RNAs are organic solvent-extracted with phenol and chloroforni followed by ethanol 
precipitation, this reprteerits the secdhd ethanol preeij>itation of the isolated RNA. Empirical 
observations -suggest that this repeated prebijpitatioh improves RNA performance in the RT 
reaction to follow. 

F6ilov^g DN^ aii aliquot of the RNA su^petisioii hi lethariol is temovfed and 

divided into thirds. A different procedure is performed on ea6h biie of the di^udtthirds: these 
three procedures are: (1). Aii OD 2 60/2S0 * s Obtaihed usiiig a standard jif 6tbcol aiid is used to 
estimate the amouht of RNA present and its likely qti^ity; ^ 

agarose gel, and the RNA is stained with ethidium bromide. Observation that both the 28S and 
18S RNAs are visible as discreet bands and that there is little staining abbve the point af which 
the 28S rRNA migrates indicate that the RNA is relati\^u^^ 

performance that the examined RNAs be completely free btpartiE d^^^(m 9 Ai&ifaifitisfot to 
determine that the RNA is not so degraded as to significantly eSfect the appearance of the 28S 
rRNA. (3). The total cell KfrAs 4re run using a ^CR^based telttt fc&ifitiife tSia* the 4 DNasel 
treatment actually digested the contaminating genomic DNA to completion. It is Very nteffbrtant 
to confirm complete digestion of genomic DNA because genomic DNA may act as a tfettiplate in 
PCR reactions resulting in false positive signals in the relative quantitative Rt^GR assay 
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described below. The assay for contaminating genomic DNA utilizes gene specific 
oligonucleotides that flank a 145 nucleotide long intron (intron #3) in the gene encoding Prostate 
Specific Antigen (PSA).This is a single copy gene with no jpseudogenes. It is a member of the 
kallikrein gene family of serine proteases, but the oligonucleotides used in this assay are specific 
5 to PSA. The sequences of these oligonucleotides are: 

5'CGCCTCAGGCTGGGGCAGCATT 3% SEQ ID NO:6 
and 

S'ACAGTGGAAGAGTCTCATTCGAGAT 3% SEQ ID NO:7. 

In the assay for contaminating genomic DNA, 500 ng to 1.0 |ig of each of the DNasel 

10 treated RNAs are used as templates in a standard PCR (35-40 cycles under conditions described 
below) in which the oligonucleotides described above are used as primers. Human genomic DNA 
is used as the appropriate positive control. This DNA may be purchased from a commercial 
vender. A positive signal in this assay is the amplification of a 242 nucleotide genomic DNA 
specific PCR product from the RNA sampje being tested as visualized on an ethidium bromide 

15 stained electrophoretic gel. There should be no evidence of genomic DNA as indicated by this 
assay in the RNAs us^d in the RT-PCR. assay described below. Evidence of contaminating 
genomic DNA results in re-digestion of the RNA with DNasel and reevaluation of the DNase 
treated RNA by determining its OD260/28O ratio, examination on electrophoretic gel and re-testing 
. for genomic DNA contamination using the described PGR assay. 

20 f ■ The standard conditions vised for PCR (as mentioned in the last paragraph) are: 

IX GipCO-BRL PCR reaction buffer [20 mM Tris-Cl (pH 8.4), 50 mM KC1] , , ^ 
l^mMlyigCl 2 ' \ ' \ : ■ 

ZOO jiM each of th^ four dNTPs , ; v 

200 pM each oligonucleotide primer . 

25 r concentration of template as appropriate . 

y 2.5 um^ , ^ , 

5f£^ for 45 SeC ; - ^y;., \, ■ - -,- ./ ^./v^ ■ 

30 72 P C for 1:00 minute. ; v, 
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The protocols described in the above section permit isolation of total cellular RNA that 
overcomes two of the six barriers to successful RT-PCR, Le. the RNA is acceptably intact and is 
fr£e from contaminating genomic E)NA. 

Reverse transcriptase^, also called RNA dependent DNA polymerases, as applied in 

5 currently used ''-'iu^d^i^ar^liiolb^y ' protocols, are known to be less prbcessive than other 
coitimdiily used nucldc acid jx>lymerases: It has been obseirved that notonly is the efficiency of 
Conversion of RNA to cDNA relatively inefficient, there is also several :f6ld variation in the 
efficient^ of 6DNA synthesis betWeeri reactions that use RNAs as templates that otherwise 
appear indistingixishdbief The sources of this variation are hot well characterized, but empirically, 

10 it has been observed that the efficiencies of some reverse transcription (RT) reactions may be 
improved by repeated brgahic ethanal precipitations. This implies that some of 

thfc ^ In ithis case, the DNasel 

treatnieht described abbVe may be aiding the efficiency of RT by subjecting the RNA to an 
additional cycle of extraction with phenol and chloroform and ethanol precipitation. 

15 - ^ntaitiiriatibh of the template RNA with inhibitors of RT is ah important biarrier to successful 
RT that is partially overcome by careful RNA preparation and repeated organic extractions and 
ethanol pfecipitationsv :i: <^' r -- ^ 

^eVfe^ li^k^tiori it&ciibn^ are the iSuperscri^tl? - Preariiplification 

System for First Strand C©NA Syrithesis kit which is to 

20 1^ Superscripts is a;^ M-MLV reverse, 

i^t^^vify^ j^lishedi^tc^o^ t&e m^t^turer are^used for 

cDN^#^ *a>NA^yHft a 

^ v bligoi^cle^des Wrandcwii ^qiiei^ laj^pligo dT. 

25^ A ' ' ^^^e^d^fidtf of bli]go dT increases the efficienc^of the 
fc^t^pl^^^^ 



& j - 




Even with the best prepared RNA atifflffie me^t 
30 significant variation in the efficiency of RT. This variation would be sufficiently great that 
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cDNA made in different RTs could not be reliably compared. To overcome this possible 
variation, cDNA populations made from different RT reactions may be normalized to contain 
equal concentrations of amplifiabie cDNA synthesized from mRNAs that are known not to vary 
between the physiological states being examined. In the present examples, cDNAs made from 
total cell RNAs are normalized to contain equal concentrations of amplifiabie b-actin cDNA. 

One jil of each diluted RT reaction is subjected to PGR using^oligonucleotides specific to 
P-actin as primers. These primers are designed to cross introns, permitting the differentiation of 
cDNA arid genomic DNA. These fj-actin specific oligonucleotides have the sequences: 
5VCGAGCTGCCTGACGGCCAGGTCATC 3% SEQ ID NQ:8 

and , :;: ' .. ; 

5* GAAGCATTTGCGGTGGACGATGGAG 3 % SEQ ID NQ:9 , r 

PGR is performed under standard conditions as described previously for either 19 or 20 
cycles. The resulting PGR product is 415 nucleotides in length. The product is examined by PCR 
using agarose gel electrophoresis followed by staining with ethidium bromide. The amplified 
cDNA fragment is then visualized by irradiation with ultra violet light using a transilluminator. 
A white light image of the illuminated gel is captured by an IS-1 000 Digital Imaging System 
manufactured by Alpha Innotech Corporation. The captured image is- analyzed using either 
version 2.0 or 2.0 1 of the software package supplied by the manufacturer to determine the 
relative amounts of amplified p-actin cDNA in each RT reaction. 

To normalize the various cDNAs, water is added to the most concentrated;; eDNAs as 
determined by the assay described in the last paragr^pjht PGR using 1 pl ot the newly rediluted 
and adjusted • cDN^^ of 
cycles of PCR must be increased to 21 or 22 cycles in order to compensate for the decreased 
concentrations of the newly diluted cDNAs. With thi^ empirical ;m^^ be 
adjusted by dihation to contain roughly equal concentrations of amplifiabie cI5^^ Sometimes 
v J^rfm^a^.^ni^. tofe -xeprnted -tol acceptable final normalization; By diyi^g th^ average 
^^^^ ^ 

created that will permit mote accurate compmsons of the relative abundance of RNAs 
exainmedinth^ ^v----- ..y.i: ;j>/ - p ; 
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i 

Once the normalization statistics are derived; PGl^ may be perfcriiieid using different gene 
specific oligonucleotides as primers to determine the relative abundances of other mRN As as 
represented as cDNAs in the normalized p^el of diluted RT redaction prodtibfe. The relative 
intensities of the bands is then adjusted and normalized id p-actih expf essibri by multiplying the : 
5 "intensity quantities by the normalization statistics derived. 
' In the next section an &T-PCR 

likely to capture data from PCRs while in the linear portions of their mhplifidation curves. The 
error duised by bbseiving PtRs after the linear portion of PCR is in the direction of 
quantitatively underestimating inRNA abundance differences. To determine quantitative 
10 differences in mRNA expressibn, it is necessary that the data is bolleeted in the linear portion of 
" ■""'^fiie- respective PCR amplification curves. This ; requirement is; met in the assaty described in 
follow 

The last two barriers to RT-PCR are Addressed in sectibiis thkt follow invo the 
use of pooled cDNAs as templates in RT-PCR. In practice, the protocols using pooled templates 
15 are usually performed befbrc above. 

There are two additional barriers to relative ihRNA qu^ that 
frequently compromise interpretations bf te^lts; bbtkiried by this method* fh£ first of these 
A ih\tolvfes the need to ^tiahtffy the /IffipM the P&k is still in the linear 

portion of the process where *'E^ behaves as a constant and is nearly equal to two. In the "linear* 
20 portion of the amplification curve, the log of the mas? of the amplified product :is directly 
' |^ <I#eM bf the" PGR jp^rbce&j **B^ *s riot Constant. Late in 

; uritil th^fe^ itt^ase hilP^p^dkicrf M^ with 

^ ^ ^ at 
h c *vMc% 4fie ofigbniiclebtid^^riiiie4§'^g^ to'the individual product strands. TMs competition 

PCR 

'^--^eis^ the 
30 "plateau" phase of-fe*&^ a» d 
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PGR begins to move towards the? plateau phase, the conservation of relative proportionality of 
amplified .products during PCR is lost. This creates an error in estimating the differences in 
relative abundance of an mRNA species occurring in different total cell RNA populations. This 
error is always in the same direction, in that it causes differences in relative mRNA abundances 
5 to appear less than they actually are. In the extreme case, where all PCRs have entered the, 
plateau phase, this effect will cause differentially expressed mRNAs to appear as if they are not 
differentially expressed at all. 

To control for this type of error, it is important that the PCR products be quantified in the 
linear portion of the amplification curve. This, is technically difficult because currendy used 
10 means of DN A quantitation are only sensitive enough to quantify the PCR products when they 
are approaching concentrations at which the product strands begin to pompete with the primers 
for annealing. This means that the PCR products may only be detected at the very end of the 
linear range of the amplification curve. Predicting in advance at what cycle number the PCR 
products should be quantified is technically difficult. 
15 Practically speaking, it is necessary to sample the PCR products at a variety of cycle 

numbers that are believed to span the optimum detection range in which the products are 
abundant enough to detect, but still in the linear range of the amplification curve. It is impractical 
to dpthis; in a sttidy that involves large numbers of samples because the number of different PCR 
, H ^reactions and/or number of different electrophoretic gels4Aat must l>e run becomes, prohibitively 
20 , large, > ...... A . rtWv -:yi\ ?■.>.•■ 

? To overcome these limitations, a two tiered approach was designed torela^ 
, > ^ ley^lsausing RT-PCR. In ; ^he dBrst - ti^ ,j>^ls oC,cD|^y priced by 

combining equal amounts of normalized cDNA are examined to dtetetj^ 
•HfiH abundances vary in the average individual with a particular phy^j^^^ th^ 
25; very small number such as two to fonr. : |n the studies 

described herem^ indj^dn^Js gnd those 

\^ j^st#e c^cer. Each ^ of indhadtials; 

"vVi While i^s approach does not ^prii^^e 4iffe^^ 

30 it permits many duplicate PCRrreax^ons ^>e sim^ : . _ 
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The individual duplicates rhay harvested and examined at different cycle numbers of 
PGR. In studies described below, four duplicate PCR reactions were set up. One duplicate was 
collected at 31, 34, 37. and 40 PCR cycles. Occasionally, PCR reactions were also collected at 28 
cycles. Examining the PCRs at different cycle numbers yielded the following benefits. It is very 
5 likely that at least one ol iKc RT-PCRs will be in the optimum portion of the amplification curves 
to reliably compare relative rriRlsl A abundances. In addition, this optimum cycle number will be 
known, k> that studies with much larger sample sizes are much more likely to succeed* This is 
the second tier of a two tiered approach that hSa)s been taken to relatively quantify mRNA 
abundance levels using RT-PCR. Doing the RT-PCR with the pooled samples permits much 

10 more efficient applicatioH of RT^ PCR to the samples derived frbm individuals. A further benefit, 
also as discussed below/ tube to tube variability in PGR may be discounted and controlled 
because most sUidies yield multiple data points due to duplication. 

Like the previously described protocol involving individuals, the first step in this protocol 
is to normalize the pooled samples to contain equal amounts of amplifiable cDNA. This is done 

15 using Oligonucleotides that direct the amplification of p-actin. In this example, a PCR 
amplification of a cDNA fragirient derived from the p-actin mRNA from pools of normal 
individuals and individuals wth mfetastatic prostate cancer was performed. This study Was set up 
as four identicai PGR rba^ 
after 22, 25, 28 and 31 cycles^ 

20 ' Mt tiievPGRjs were still in the liiie^ i^gei of &feir ^jitification curves at 22, 25 arid 28 cycles 
b^ th^tfae^ U^lin^tyW ratids o£ the ba«id intensities 

remain constant and imerh^tly'^ but 
■ : " ^Ife^ i^os beS6irie ditf 6rted at 31 cycled This (quanititatSbh wll al^ ^rmit the derivation of 

25 ? ^ 1 ) ; : '\\ ■ oy: ft . ' 

* i -rhig ^tudy is then repeated usMg g^ for & gene othe^^tMte P-actin. The 

v ^ been 
3#- v? ::".;:^i^^gcl in the liri^^ may be demised byv 
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determining if the proportionality of the PCR products has been conserved as PGR cycle number 
has increased. If the ratio between the two pools of a given PCR product remains constant with 
increasing cycle number, this is strong evidence that the PCRs were in the linear portions of their 
amplification curves when these observations were made. (This is better conservation of 
proportionality than is frequently observed. In some studies, data was excepted when the ratios 
were similar but not identical.) This conservation of proportionality was lost at 40 cycles. This 
indicates that these PCRs arp nearing the plateau phases of their amplification curves. 

The final major barrier to quantifying relative mRNA abundances with RT-PCR is tube 
to tube variability in PCR. This may result from many factors^ including unequal heating and 
cooling in the thermocycler, imperfections in the PCR tubes and operator error. To control for 
this source of variation, the Cole-Parmer digital thermocouple Model # 8402-00 was used to 
calibrate the thermocyclers used in these studies. Only slight variations in temperature were 
observed. 

To rigorously demonstrate that PCR tube to tube variability was not a factor in the studies 
described above, 24 duplicate PCRs for p-actin using the same cDNA as template were 
performed. These PGR tubes were spattered over the surface of a 96 well thermocycler, including 
the comers of the block \yhere it might be suspected .the temperature might deviate from other 
areas. Tubes were collected at various cycle numbers. Nine tubes were collected at 21 cycles. 
: Nine tubes vvere collected at 24 cycles, and six tubes were collected at 27 cycles. Quantitation of 
the intensities of the resulting bands with the IS- 1000 system determined that the standard error 
of the mean of the? PCI^ product a^iin<3anpes was ±13%. This is an acceptably small number to be 
discounted as a ma^ - i 

variabilis that might arise from any source. By examining the abundance of the PCJR products at 
several different cycle numbers, it may be determined that the mass of the expected PCR product 
is Ineire?^ increasing PGEt cycte numbw. Npt only does this demonstrate 

that the PCRs ^ being examined in toe Ihi^ phase of theJPC^ is most reliable, 

it demonstrates that each reaction with the same template is consistent with the d^ta from the 
su^ source pf variation, the expectation that 

PGR product mass would increase appropriately ^th increasing cycle number w^^ met 
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This would indicate artifactual variation in results. Internal duplication and consistency of the 
data derived from different cycle numbers controls for system derived variation in tube to tube 
results. 

As described in the preceding paragraphs, the RT-PCR protocol using pooled cDNA 
5 templates overcomes the last two barriers to effective relative quantitative RT-PCR. These 
barriers are the need examine the PCR products while the reactions are in the linear portions of 
their amplification curves and the need to control tube to tube variation in PCR. The described 

• protocol examines PCR products at three to four different cycle numbers. This insures that the 
' ' PCRs are quantitated-m their linear ranges and, as discussed in the last paragraph, controls for 

10 • possible tube to tube variation. 

v One final question is whether P-actm is an appropriate internal standard for mRNA 

quantitation, p-actin has been used by many investigators to normalize mRNA levels. Others 
have argued that p-actin is itself differentially regulated and therefore unsuitable as an internal 
normalization standard. In the protocols described herein differential regulation of p-actin is not 7 

15 a concern. More than fifty genes have been examined for differential expression using these 
protocols. Fewer than half were actually differentially expressed. The other half were regulated 
similarly to p-actin within the standard error of 13%. Either all of these genes are coordinately 
differentially regulated with p-actin, or none of them are differentially regulated. The possibility 

• that all of these genes could be similarly and eoordinately differentially regulated with p-actin 
20 seems highly uhlikely. This possibility has been discounted,. , •/ 

:: $ i ip-actm has also been criticized by some as an internal standard in PQRs because of the 

^ large nurnber of pseudogeries of p-actin ftat occur ins mammalian genomes. This is, not a 
; ^consideration in the described assays because all of the RNAs used herein are demonstrated to be 
free of contaminating genomic DNA by a very sensitive PCR based assay: In addition, the cycle 
25 number of PCR needed to detect p-actin cDNA from the diluted RT reactions, -usually between 
19 and 22 cycles, is sufficiently low to discount any contribution that genomic DNA might make 
to the abundance of amplifiable P-actin templates. . = <•■-..■ 
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Example 2: 

Identification oj r Markers oj 'Metastatic Prostate Cancer by Use of 
RNA fingerprinting by PCR primed with oligonucleotides of arbitrary sequence. 

5 RNA fingerprinting by PCR, primed; with oligonucleotides of arbitrary sequencd was 

performed on RNAs isolated from peripheral human blood. Bands which appeared to be 
differentially expressed were cloned. - 

For this study* total celt RNA was isolated from btiffy coat cells as described above; cDNA 
was made from one to five m» of each isolated RNA. All cDNAs were normalized for similar 
1 0 amounts of B-actin cDNA by RT-PCR. RT-PCR products were electrpphoresed through agarose. 

For relative quantitative RT-PCR with an external standard, quantitation of band intensities 
on ethidium bromide stained gels was performed using the IS- 1000 image analysis system 
manufactured by the Alpha Innotech Corp. A normalizing statistic was generated for each cDNA 
sample, as the average of all B-actin signals divided by the B-actin signal for each cDNA sample 
15 respectively. Data for each sample was then normalized by multiplying the observed densitometry 
observation by the individual normalizing statistics. Normalized values predict differences in the 
steady state abundances of the respective rnRN As in the original total cell RNA samples. 

The nucleotide sequences of all cloned PGR products were determined by dideoxy 

ABI or Phannacia automated sequencers. ^ 
20 lliis^rofocdl resulted in the discovery of an mRN A species that was 2-3 fold less abundant 

in the peripheral blood of metastatic prostate cancer patients than in the peripheral blood of normal 
individuals of both sexe^, Tlie s^uience of this band, referred to as UGBP Band #35 (SEQ ID 
NO: 1)^ matches aii ESl^di^fv^ from a^fetal brain cl>h^ library ^GenBank A<^^ioriiTn)3013). 
regtilatidh of this band ih the peripheral blood of metastatic prostate cancer patients was 
25 " corifinri^ ^ n 

■■ ^y-'v^i^r^ - ?r i-%- y ^p. : \-'^thi^m ' ' . Example 3: . w v ' ' , < W • ^^-A^jf^-i ' . ' ' 

Identification<jfMorker^0^ 
Use of RNA fingerprinting by the Pairwise Sequential Method 

30 

RNA fingerprinting was used to identify differentially expressed RNA species according to 
the pairwise sequential method of McClelland et at (1994), as modified to use larger (17-25 mer) 
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arbitrary oligonucleotides. PCR amplification products were labeled using <x-32P-dCTP and Were 
•visualized by autoradiography after electrophoresis on denaturing polyacrylamide gels. A number 
of bands appeared to be differentially expressed, and were cloned as described above. 

UC Band #32 1 was confirmed by RT-PCR to be do\Vn regulated in thie peripheral blood of 
prostate cancer patients, with a four-fold decrease observed compared with normal individuals. 
The DNA sequence of Band #32 1 does not match any known sequences in the GenBank database. 
It therefore represents a previously undescribed geiie products 

UC Band #302 and UC Band #325 were both observed tc> be up regulated in the peripheral 
blood of metastatic prostate cancer patients. UC Band #302 is identical in sequence to a portion of 
the sequence of elongation factor 1-a (GenBank Accessioh #X03558). This band was modestly 
mbreased between 1 .6 arid 2-fold in metastatic cancer patients compared with normal individuals. 

UC TBahd #325 Was^^fo consist of two different 

encoded by the interleukin-8 (IL-8) gene. UC Band #325-1, the previously identified mRNA 
species of IL-g (Genbank Accession #Y00787), is apprb^rlntateiy seven-fold more abundant in the 
peripheral blood of metastatic prostate cancer j>atfents. The alternatively spliced IL-8 mRNA, 
containing intron #3 of the IL-8 gene (Genbank Acteission #M28130) is up to seven-fold less 
abundant in the peripheral blood of metastatic prostate cancer patients. Fig. 1 A shows relative 
qti^titative RT-PCR of the differential expression of IL-8 (-UC235) in peripheral blood of 
patients wi^ ihet^ pixistate cancer (1^ at different PGR cycles 

(cy). The two alternatively spliced forms of the IL-8 mRNA are observed. The upper band (int.+) 
includes intron 3 in the mature mRNA^ Stife lower band (int.-) lacks intron 3. Fig. IB shows 
relative quantitati^fcR in peripheral blood 

of patients with metastatic prostate cancer in lanes 1-5 and tk pool of normal individuals (N)- The 
alterjtetively spliced foims of the IL-8 mRNA observed are different between normal individuals 
and those with prostate cancer. Overall, there fe an approximately T 30-f6ld change in the ratios of 
thbiiM <^ed forms of IL^ 8 mRNA in individt^s with metastatic prostate cancer coinpared with 
nom^ itidividiials. The^e results have been coiofirmedby relative q^titativeRT^Ri: , 

As described abovfe, an increased expression of IL-8 mRN&fcas^ in 
caolr ^atieiits. However, this represents the first finding of an ^t^^tively ispliced form of IL-8 
mRNA; containing intron 3> that is significantly more abundant in iionnal iiidiyi^d 
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with metastatic prostate cancer patients. Theses results are surprising in view of previous reports 
which had failed to find any alternatively spliced forms of IL-8 mRNA in normal individuals or 

cancer patients. 

It will be recognized that the genes and gene products (RNAs and proteins) for the above 
5 described markers of metastatic prostate cancer are included within the scope of the disclosure 
herein described It will also be recognized that the diagnosis and prognosis of metastatic prostatic 
cancer by detection of the nucleic acid products of these genes are included within the scope of the 
present invention. Serological and other assays to detect these inlWA species or their translation 
products are also indicated. It is obvious that these assays are of utility in diagnosing metastatic 
10 : cancers derived from prostate and other tissues. 

Most signi Hcantly. these Examples denqionstrate the feasibility of ming RNA fingerprinting 
to identify mRNA species that are differentially expressed in the peripheral blood of patients with 
asymptomatic diseases or in patients with symptoms that are insufficient for a definitive 
diagnosis. It will be appreciated that this technique is applicable not only to the detection and 
15 diagnosis of prostate and other cancers, but also to any other disease states which produce 
significant effects on lymphocyte gene expression. Uses which are contemplated within the scope 
of the present disclosure include the detection diagnosis of clinically significant diseases that 
requires medical intervention, including but not limited to asthma^ lupus eiythrqmatosis, 
rheumatoid arthritis, multiple: sclerosis, myasthenia gravis, autoimmune thyroiditis, ALS, interstitial 
20 cystitis and prostatitis; / - , f ; 

" : V.* ) • y • - TABLE 2 7. r ',' " >; ^. r ; " ' >Uy, ■■ 

Gen^ 



Name of 
cPNA Fragment 



Sdquehce 
Deternjined 



Gonfinried 
by RT-PCR 



UCPB35 , 

•U<3 ?21 S|Q iP Na:2 ; : 
WC 325-1 S^ID NQ;4 
Up32Sr2SEQID NQ:S 



^Yes 



Yes 
Yes 



GB#TQ^01| 

, Mo 
PB#Y00787 
1I>8 
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TABLE 3. 

Oligonucleotides used in the relative quantitative RT-P£R portion of these studies. 

Oligonucleotidesused to examine the expression Of genes: 
UCPB Band #35 (previously uncharacterized gene). 
5 5' TdeAAACtTTCAC 
•3 5' CTTGTGACTTGCTTTGATAGAATG3', SEQ ID NO:l 1 

10 UC Band #302 (elongation factor 1-a). , > 

' " 5' GACAACATGCTGGAGGGAAC3TGC3 , , SEQ ID NO: 12 

•- * s 5' AGGACGAATTTTGTAAGAACATCGT3\ SEQ ID NO: 13 

UC Band #321 (previously uncharacterized gene). 

5' TGTCCAGAGATCCAAGTGCAGAAGG3', SEQ ID NO: 14 
15 5' GAGCTCCAGGAGACAOAAGCCATAG3', SEQ ID NO: 15 

UC Band #325-1 (IL-8). 

5' GGGCeCCAAGGAAAACT3\ SEQ ID NO:l 6 
r 5' TGGCAAGCCTACAAGAGACG3'i SEQ ID NO.17 

UC Band #325-2 (IL-8). ' ; " r -r- . f -l ' ,; ' . .' ■' '.- 

20 5' GGGCCCCAAGGAAAACT3', SEQ ID NO:l 8 

5^TGGGAACCeTACAAGAGAeC3', SEQlD NO:l 9 

Cohux>lswsed to hbr^ 

f - >. , 6-actin , ; : > - A v'^V •'••>/<-.■ .- b- « - ..>;.>'">'■ 

25 v S'Ct3AX3GtGC^ " * 

, ^ , ^ ; - $ G*P^CATTT^GGTGGAC^ NO:9h ? 

As^ar^Synthetase(AS) ' ">* ' " * * ^ ' " »" ; 

f> ,^ r £,AC^ > • • " 

30^/:'..., .-, . ' ... i ... ., :• . v % . ', t ; ' , v 
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Example 4: 

DNA Sequences of Markers of Metastatic Prostate Cancer 



The DN A sequences of the markers of metastatic prostate cancer were determined by 
5 Sanger dideoxy sequencing as detailed above. The identified sequences are provided in Table 4. 



TABLE 4. 

DNA Sequences of Markers of Metastatic Prostate Cancer: 



10 UCPB Band #35 (SEQ ID NO:l) Matches a fetal brain EST, GenBank Accession # T03013 

S'GGCAGGGGCTTGTGACrCTAAGATGGCTTCATTCACATGCCTAGGGCCTCAGTAGG 
ATGA CTGGG ATGGCGGTGGAAAAGTGGGAAGTCTTCTeTCT 

GGACTTTTTATATGATTCTGGAAGTATT€GAAGAAGGCAAAAGTAAAAACTGCAAA 
GCGTCTTAAAATAGAAGTTCAGAAGCCACATTATATCACTTCTGTTGCATTCTATCA 
15 AAGCAAGTCACAAGCCCCTGCCAATCA 3' 



UC Band # 321 (SEQ ID NO:2) previously uncharacterized Gene 

20 5'CACACACTCCCCCATTCTGAGCCCCAAGAGGCTGATCCCTAAGGATGTCCAGAGA 
TCCAAGTGCAGAAGGAGAATGTGGTGAGGCTATTTATTCCCCCAGTGCCTTCCCTGC 
TGGGCTATGGATGAAGAGTGGGTGACTTCATCTAGGAAAGAGGTATGGCTTCTGTCT 
CCTGGAGCTCACCA 3* 

25 ■ ■ ■■: ; : " ■ ■ ■ • ■ ,■>•>■: 

UC Band #302 (SEQ ID NO:3) Human Elongation Factor 1 ^alpha, GenBank Accession 
#X03558 « - 

5'GGTGAGCCCCAGGAGACAGAAGAGATATGAGGAAATTGTTAAGGAAGTCAGCAC 

30 TTACATTAAGAAAATTGGCTACAACCCCGAC^ 

TTGGAATGGTGACAACATGCTGGAGCCAAGTGCTAACATGCCTTGGTTCAAGGGAT 
GGAAAGTCACGGGTAAGGATGGCAATGCCAGTGGAACCACGGTGCTTGAGGCTCTG 
GACTGCATCCTACCACCAACrcbfcCAACTGACAAGCCCTTGCGCCTGCCrCTCCAA 
GGATGTTCTTACAAAATTGGTGGTATTGG^ 

35 AACTGGTGTTCCTCCA AACCCCGGTTATGGTGGGTTTCCTCCTCCTTGGA 3 * 

UC Band #325-1 (SEQ ID NO:4) Human IL-8 mRNA, GenBank Accession #Y00787 

5'GGGCGGAACAAGGGAGCGCTAAAAGG 
40 ATAATTCCAAAACCTTTCCAAACCCCAAATTTATTCAAAGGAACTGAGGAGTGGATT^ 
GAGGAGTGGACCAACACTGGCGCCAAACACAGAAATTATTGTAAAGCTTTCTGATG 



wa<5nnntr> <wr> 
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GAAGAGAGCTCrGTCtGGGCCGCAAGGAAAACTGGGTGCAGAGG 
TTTTTGAAGAGGGCTGAGAATrCATAAAAAAATTCAl"TQTCTGTGGTATCCAAGAAT 
CAGTG AAG ATGC C AGTG AAAGTTC AAGCAAATCTAGTTC AACAtflT 
TGGGTCTGTTGTAGGGTTGCCAGTTGTT 3' 

UC Band #325-2 (SEQID NO:5) Human IL-8 mRNA Containing Intron 

5'GCITGOGCCCCAAGGAAA 
1 0 GTAAGTTATATATTT^ 

AAGTCAGCCTATA^TTTCtT^ 
TGGTTAAAAAAAAAAGGAATAGCATCAATAGTC 

AAGACCATACATAGTTTGCCCAGGAAATTCTGGGTTTAAGCTTGTGTCCT 
AGTAAAGTTCTTTQTCACTCCCAGTAGTGTCCTATGTT 
15 TCCCTATTTATAGTTGAGi^fAfAGAGCAfGT 
: • ATTGACTTTTCAAGAAGC 
GAATTTTATTTTAGGGCTG 
AAtCAGTGAAGATC 
GTGTGGGTCTGTTGTAGGGTTGCCA 3' 

20 

Example 5: 

Detection arid Differential Diagnosis of BPHvefsits Localized and 
Advanced Stage Prostate Carcinomas Using 
25 'Goin^^^P^^JSC^^^ Other Prostate Disease Markers. 

A ratal of 164 serum specimens from nortell inen or men with a BiBjJ^y confirmed 
diagnosis of BPH or prostate cancer were studied. These serum specimens were provided by Dr. 
Geo^e Wright from the Virginia Prostate Center at the Eastern Virginia Medi^ Sehpbl and by 
30 Dr. Robert Vessella from the University of Washington or were: normal donors from UroCor, 
.^Inc. M for eithdr fe^^ and 

C only) within six months after PSA serum collection and/or a DRE-positive diagnosis. Ail 
V ^ surgical o^ honnonal ; therapies pie me^n age of the total 

sample w^ ; 6?.4 ± 8-6 years -(range = 37 * 91, years);,old. 
35 v The sub^t of pati^ts> utilize for multivariate 4ia of 13 

• BPH and 64 GaP (Stages A^B* C) cases fft>m ^ All 
v , Tients in the subset had a total PSA b^ for m 
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PSA testing (Marley <?/ al y 1996). Also evaluated were a subset of Stage T> CaP patients, with t- 
PSA values ranging from 6.5 - 867 ng/ml. 



Diagnosis 


N 


Mean Age ± Std. Dev. (Range) 


Normal 


8 


< 50 years 


BPH 


55 


66A± 8.6 (37 - 87) years 


CaP Stage A 


24 


74.7±7.8 (61 - 91) years 


CaP Stage B 


48 


68.3±7.9 (5 1-85) years 


CaP Stage C 


' 14 : ' 


68.9±6.9 (60 - 80) years 


CaP Stage D 


14 


72.3±8.6 (58 - 86) years 



Table 5 shows the distribution of the total PSA levels, the f/t PSA ratios, and the UC325 
levels for the 164 patients, broken down by .normals, BPH, and Stages A> B, C, & D prostate 
cancer. Only the BPH, Stage A, Stage B, and Stage C prostate cancer patients were included in 
the statistical analysis. " 



TABLE 5 

UC325 Patient Sample Characteristics (n = 164) 





-. ' -) ■ ■ 




Mean Value ± Std* Dev. 








UC325 


Total PSA 


mpsA 


Diagnosis 


. N 


(pg/ml) 


(ng/ml) 


Ratio (%) 


Normal 


8 


0.2 ± 0.6 


N/A 


N/A 


BPH 


55 


6.8 ±6.1 


6,9 ±4,0 


21:9 ±10.9% 


CaP Stage A 


24 


19,1 ±10.4 


6^2 ±2.7, 


14.6 ±10.5% 


CaP Stage B 


48 


13.5 ±9.5 


8:8 ±616 


li;9±5.7% 


CaPStagdC 
CaP State D ; 


15 
14 


- 19.1 ±7.9 
78.9 ±197 


16:2*7,6 
244 ±332 


11.2 ±8:3 ; ^ 
12.4 ±7*1% ,,v 



Table 6 flluis*rafes the ability fo Ml«A iatio at three difi&enr ctrtoffs m ^exehtiate 
prostate cancer and BPH in the invfeiitors 1 patient sample. UG325 (IL-8) and t^PSA ^ analyzed 
at single Caassifieation ahd Regression Tree (CART); ciitbff points for tfie ^aine outcome. Note 
ihe signiftciiritMi^^ (IL-8) 
sdftikn ias^ to detect xtoilcklly b^gan confined. The com as a 

continuous viable, and t-PSA or f/t PSA ratio provides a highly predictive multivariate test 
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system to diagnose CaP (clinical stages A & B) without any interference provided by BPH in the 
inventors' patient subset. v f 

TABLE 6 



Ability of Serum Tests to Dis criminate BPH and CaP. 



Serum Test 


Cutoff 


Sensitivity 


Specificity 


V/./-AUC 


p-value 


p ft PSA Ratio 


11% 


52.9% 


91j9% 


0.7905 


< 0.0001 


MM 


14% 


70.1% 


80.0 


MM : 






20% 


85.1 


47.3 


MM ' i 




UC325 


9.8pg/ml 


72.4% 


74.5% 


0^7973 h 


<p,oooi 


Total PS A 


14.8 ng/ml 


17.2% 


98.2% 


0.5995 


0.0134 


f/tPSA&UC325 


, 0.69** 


71.3% 


: 90.9% 


:, 0.8784. 


<o pooi 


Total PSA & UC325 


0.64** 


.62.1% 


... 85.5% 


0.8069 


' <0 000l 



*A11 cutoffs determined using Classification and Regression Tree Analysis (CART) 



* ^Predicated Probability value calculated usictg logistic regression function 

To further substantiate the results of Table 6> indi vidual analysis using Receiver Operator 
Characteristic (ROC) curves are provided for each variable. Figure 2 illi^^te^ Ae ability of t- 
PSA to distinguish BPH and Stages A, B, and C prostate cancer. Figure 3 shows the ability of fit 
PSA ratio to distinguish BPH and Stages Ai B, and C prostate cancer. Figure 4 shoWs disability 
of ipp^Si (IL-8) #nief ^ ^sto^ish^^Pf JStagfs and C prp^tate cancer. Figure 5 

shews the ability of the pombination of UC325 (E>-8) and total PSA (t-PSA) to distinguish BPH 
and Stages A, IB and C prostate cancer. Figure <6 shows the ability of the qombmahon of UC325^ 
P^Mtio to distinguish betwe^ BIp|ahd stages A, B ai^^^ostate cancer. 
It is apparent that^t^'.epmbinati with either t^§A' or f/T PSA 

pro^SWi »gnific^t r liiiafease in S0j4§itiyity of detection, while -mamtaming a high degree of 
specifiira^ Thus, thfe cp^bination M JJC325 (IL-8) \yitfi other prostate dise^em^rkers, such as 
t-PSA or M PSA ratio, provides a significant advaiice in the detection and differential diagnosis 
of prostate cancer. 
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Table 7 presents the correlation values for the different serum markers. This table clearly 
shows that the UC325 biomarker provides information which is independent of that provided by 
the f/t PSA ratio. 

TABLE 7 

Correlation Values for BPH vs Stages A, B & C (n = 142) 





Diagnosis 


Total PSA 
(ng/ml) 


. Ratio (%) 


UC325 
(pg*ml) 


Age , 


: Clinical 
Stage 


Diagnosis 


1.0000 


0.5647 


-0.1912 


0.2262 


0.1590 


0.3497 


Total PSA 














(ng/ml) 


0.5647 


1.000 


-0.2319 


0.5991 


0.0898 


0.3729 


f/t PSA 














Ratio (%) 


-0.1912 


0.2319 


1.0000 


-0.2142 


0.0641 


-0.4126 


UC325 


• 












(pg/ml) 


0.2262 


0.5991 


0.2142 


ioooo 


0.0881 


0.2486 


Age 


0.1590 


0.0898 


0.0641 


0.0881 


1.0000 


0.1372 


Clinical 














Stage 


0.3497 


0.3729 


-0.4126 


0.2486 


0.1372 


1.0000 



10 ; 

Mr 



Table 8 clearly derhonsttates a relationship between tumor burden and serum UC-325 
gene product measured by IL-S assay. Note that as biopsy-confirmed clinical stage of the cancer 
increases, so dobs thfe IL-8 seiilm marker concentration^ whereas; the s^iae relationship did not 
6ccurwitfr{^ - 



Normal 



TABLES 




N 



UC325 (10 pm/ml Cutofi) 

Negative , : - . Positive a : 



UC325 (15 pg/ml Cutofi) 



Negative 



Positive 



8 

55 
15 



8 (100%) 0(0%) 

41 (75%) 14(25%) 

25^35%) ; :V- 47,165%) 

0(6%) 15(100%) 



2(14%) 



12 



8(100%) 0(0%) 

50(91%) £|9% 

43 (60%) v - 29 (40^ v 

5 (33%) 10 (67%) 

3(21%) 11 (79%) 
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Example 6: 

Identification oj 'Marker s of Metastatic Prostate and Breast Cancer by Use of 
RNA fingerprinting by PCR primed with oligonucleotides of arbitrary sequence. 

5 RNA fingerprinting displays PGR™ amplified cDNA fragments that represent a sample 

of RNA species derived from a population of total cell RNAs. When displayed side by side, 
- comparisons of similarly produced fingerprints representing RNA populations from cells of 
. differing physiologic states identifies mRNA species whose relative abundances vary between 
the examined physiologic states. In this study, RNA fingerprinting identified two cDNA 
10 f fragments derived from mRNA species that had higher steady state abundances in the peripheral 
blood leukocytes of patients with recurrent metastatic prostate; cancer as compared to a group of 
healthy volunteers. 

Eight ml of peripheral blood was collected from healthy volunteers, patients with 
clinically and biopsy confirmed BPH, localized and advanced metastatic prostate cancer, and 
15 j from patients with advanced metastatic breast cancer. Metastatic prostate and breast cancer 
patients that had failed a primary therapy and had evidence of recurrence of disease were 
selected. The metastatic prostate cancer patients had high (> 50 ng/ml) serum concentrations of 
:-^r' ? --PSA.- ; Circulating nucleated peripheral , blood cells were separated from erythrocytes by 
cenfrifugation in Vacutainer CPT 1 ^ tubesi(B<ectpn Dicldnson and Company^ Franklin Lakes, N 
20 v r J). ; ^otal KNA was prepared from isolated nucleated p^ripheoil bipod cells by lysis with RNA . 
Stat*60™ (TelrTest, Inc., Friends\yood, TX), following the instructions provided by the vendor. 
* \ Gpntaminating, genomic DN A Was removed from the total RNAs by digestion with RNase free 
, .*<•., . L JDNasel (GIBCO-BRL, Gaithersburg, MD> For the PCR™ based applications of RNA 
fingerprinting and relative quantitative RT^PGR^ it is absolutely critical that the, total RNA is 
25 completely free of genomic DNA. Typically, 5^0 to 10.0 j*g o^total RNA was digested with 

.X'V^ ^^O^lftainits of RNase free DNasel in 100-200 til of reaction Volume for 20 min at 37°C. 
■\ : v;v . .. v£ Inc., 
v Splpn^ OH) and ethanol precipitated. To confirm th^t the RNA was free of contaminating 

genomic DNA, 500 ng to 1.0 jig of rach DNaseTfri^^^ These 
30 were used as templates for PCR™ using oligonucleotide primers that anneal to exons 3 and 4 of 
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the gene encoding PSA (exon 3: 5' GCCTCAGGCTGGGGCAGCATT 3' SEQ ID NO:22, exon 
4: 5' GGTCACCTTCTGAGGGTGAACTTGC 3' SEQ ID NO:23). These primers anneal to 
opposite strands of genomic DMA that flank the 145 hp intron 3 of the IPSA gene. PGR™ was 
performed at 94°C for 1:1 5 min, followed by 40 cycles of 94°C for 45 sec, 55°C for 45 sec, and 
5 72°C for. 1:15 min, then a final extension of 72°C for 5:00 min; RNA Was considered DNA-free 
if no PGR™ products could be visualized upon gel electrophoresis that co-migrated with the PSA 
gerie positive control of known human genomic DNA. If PSA gene products were observed after 
PCR™, the RNA was redigested with DNasel and analyzed again for contaminating genomic 
DNA. After it was confirmed that the RNAs were free of genomic DNA, 500 hg to 1.0 ng of 

10 RNA was electrophorescd on a 1.2% agarose Tris Acetate EDTA (TAE) to visualize the 
ribbsomal RNAs (Fridell et al. y 1995). Only RNA preparations for which the 28S ribosomal 
RNA could be visualized were selected for further analysis by RNA fingerprinting and relative 
quantitative RT-PCR™. 

RNA fingerprinting with arbitrarily chosen oligonucleotide primers (Welsh et at y 1992) 

15 is conceptually similar to differential display (Liang and Patrdee, 1992), except that 
oligonucleotides of arbitrary sequence are used to prime both strands of cDNA synthesis instead 
of just second strand synthesis, as in differential display. In this investigation, the strategy of 
RNA fingerpriritmg used was similar to that described in Ralph et ah (1993) except that 
oligonucleotide primers used were composed of two discrete domains. The 5' domain of these 

20 oligonucleotides consisted of ten nucleotides that complemented sequences from either the T7 
promoter the Ml 3 reverse sequencing primer. The 3' domains of these oligonucleotides were 

- ^-n^r in a 
perihiscuous fashidn (^pez-Nieto and Nigairi, 1996). These ohgoAiicl^tideis Were then used in 

v at sequential ^^h^se; st^tfegy -thit optimizes the amount of mRNA cdiirf)lexity that can be 
25 surveyed with lifted riumhers of prrmers and starting RNA. ^ Car& was taken to ensure that the 
two oligonucleotides used to produce any single fingerprint did not share sequence similarity in 

— ^ sequence 
■ ^ si ^&&^inij >tfe^t haVie specifi c fiincti oris "Withiii this experimental design, the ohgpntieleotides are 

^riniscuous raflier thain tiuly arbitt^ in nature. ^ 
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Two RNA pools were fingeiprinted. These two pools were each created by combining 
/equal amounts of peripheral blood total RNA from five individuals. One pool was constructed 
by pooling RNA from five healthy individuals while the other pool was derived from five 
individuals with recurring metastatic prostate cancer. Using the pooled RNAsas templates, first 
5 strand cDNA synthesis was primed by annealing One of the permiscuous oligonucleotide primers 
f Mo the piwlctf llMAs at low stringency. All fingerpriritihg studies were performed in duplicate 
using diflcrcnt initial concentrations of template RNA. The replicate fingerprints were initiated 
by usiny cither 125 ng or 250 ng of RNA as template during first strand cDNA synthesis. 
^ReactioiV conditions for first strand cDNA synthesis were 250 units of Superscript II™ 
10 (GlBC() I1RI/ Gaithcrsburg* MD) in IX supplier' s reaction buffer (25 mM Tris-HCl [pH=8.3fj/ 
37 5 mM KC1; 3 0 mM MgGl 2 )/ lO mM DTT, ^00 jtM e&bh dNTP, and 2.0 jiM permiscuous 
oligonucleotide in a 40 pi volume. The latter was incubated for 1 h at 37°C. Following first 
strand cDM A synthesis, the RNA was digested Avith RNase M and heat inactivated at 70°C as 
directed by the supplier. 

15 One-tenth (4:0 pA) of the first striaM cDNA reaction mixture was used in the 

fingerprinting PGR™ reaction. As many as ten different RNA fingerprints were generated from 
each first strand cDNA reaction. To the first strand cDNA, 36 f*l of a PGR™ iriik -solution was 
addkL The latter ccurtaiiied 50 inM f ns-Ct e>H==8:3), 5b mM KCl, 200 plM each dNTP, 1.0/|^Ci 
of a 33 P-dCTP, 2.0 ^M second permiscuous oligonuclebtide "and 10 unit of recombinant Taq, 

20 E)NX polymerase (G I BCO-BRL, Gaithersbiirg; MD). Note that the concentration of the first 
; ^ less that '200 nNf. PGfe™ fnigerprinting \VaS petfbfihed Hvith one 

recycles 6^4 6 fc for 45 sec, 48^ thin. A filial ^tension step of 

7i 6 C fo# 5:00 was performed. Next, 4.0 pl of thd finial PCR*^ products were mi^ed with 6.0 \xl 
0 6f sequencing formamide dye solution and toatiii^d by heating to 75°C for 5:00 mm. 
J A^f)ibxirnat^ly XS \x\ of thb'denatured PCR 1 ^ iMiicte in foitnamide dye was electtophoresed 
4 £elv PGR™ 

* ^ -' r -- K \ V' ■ ..^ y 

'•^.^^V:^^''.-;^^^^ ^jj^eWtially ^^ring PCF^ ^ fragments t^^fifed in these 

30 studies that are the subjects of this report were termed UC331 and UC332. UC331 was 
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identified in a study in which the first permiscuous primer used in the reverse transcription 
reaction had the sequence 5' ACGACTCACTATAAGCAGGA 3' (SEQ ID NO:24). The second 
permiscuous primer that was used in the PGR™ fingerprinting reaction that identified UC331 
was 5' AACAGCTATGACCATCGTGG 3' (SEQ ID NO:25). UC332 was identified in a study 
5 in which the first permiscuous, primer used in the reverse transcription reaction had the sequence 
5' ACGACTCACTATGTGGAGAA 3' (SEQ ID NO:26). The second permiscuous primer that 
W£S used in the PCR™ fingerprinting reaction that identified UC332 was 5' 
AACAGCTATG ACCCTGAGGA 3' (SEQ ID NO:27). After autoradiography, bahds that 
appeared differentially in fingerprinting reactions on the pooled total RNAs described above 
10 were cut out of the gels and reamplified by PCR™. The reamplified PCR™ products were 
directly sequenced using the Sequenase™ reagent system (Amersham Life Sciences, Inc., 
Arlington Heights, IL.). 

The sequences of UC331 and UC332 were compared to those deposited in release 101 of 
GenBank (July 1997) using the Lasergene™ software package (DNAstar, Inc., Madison, WI). 
15, The DNA sequence of these cDNA fragments, when compared to the GenBank database, 
revealed that the mRNAs, from which these cDNA fragments were derived, were previously 
uncharac^erized. Neither UC331 nor UC332 are genes whose products have been previously 
characterized as being significant in any physiological pathway, both UC331 and UC332 match 
, ^quences on the GenBank 
20 , In the case of UC33 1 , these matches are confined to ESTs. UC331 was identical within 

the limits of sequencing accuracy to several human EST sequences. The human EST sequences 
4 ^ Pppld b^ a3»?inbled intaa vhtual contig th^t predicts the sequence 

- ■ Q^a^^r-n^A. The ends of the UC33 1 coiitig were then used to requery the E$T data base 
: ^ er ^y roore ESTs ^vyere identified that extended the contig;. This process wa? continued until 
2 ?^v>:. UC331 ^contig predicted a mRNA with an ORF and a poly-A tail. A description of the 
i r:T Jiuman ESTs were ^sed tp ^on^ct the UC33 1 pontig are provided in Table 9. The 
J&zi ?^^?^ Q ^? UC331^ <x>ntig and the ORF was identified at its 5' end A signific^t feature of 
this contig is that the ORF extends all the way to its 5' end; This v indicates that the UC33 1 
m^A extends further 5' than is indicated by the contig constructed from the EST database. 
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i'i-^ TABLE 9 .... 
UC331 EST Distribution 
Human * 



GB Accession Number 



Tissue 



Library' 



AA403120 v • 

nn^Uu 1 1— \j ...... 

v'. • 


Tntal Fptus 




Tntal Fptus 


AA1 21473 r. 


Pregnant Uterus 


AA1 21282 


Pregnant Uterus 




Plarpnta 


R22146' 


Placenta 


R30954; >clf. 


Placenta 




• IdLcllld 


R32887 h ^ 


Placenta 


R31390 h V 


Placenta 


nb/ouD 


riacenta 


R67807 9 


Placenta 


AA385620 ^ 


Thyroid 


W37985 ; ,. 


Parathyroid Tumor 


W37986 


Parathyroid Tumor 


AA38040lJ^ 


Cell line (Supt) 


AA18247lJg: 


Cell line (Held) 


AA18153(ff^ v 


Cell Kne(HeLa) 



" W31231 
N22701 

W31175 k «h . 
M34446 
N34538 
.( N36424 
N3652l*'V 
N42854~*tt®5S 

N44299 r Bm 



- v '. f ."J 



Senescent Fibroblasts r ^ ^ r 



Normal Melanocyte 
Normal Melanocyte 
Normal Melanocyte 

Normal Metanocyte ; ^ . ; fr^^L-. 
Normal Melanocyte 
Normal Melanocyte 
Normal Melanocyte 
Normal Melanocyte 



Soares > 
Soares 

Soares ';-'^''r,\y 
Soares 

v v. ' . ..." 

Soares l ^ 

Soares 

Soares 

Soares ^ 

- - • .■ . .-. 

Soares 
Soares 
Soares 
Soares ^? 

TIGR * : ; 

Soares ; ■ • v ^ ^ 
Soares ^ \ 

TIGR ^-;.;yC; 
Stratagene "V 

(IMAGE)3^f5 :; 
Stratagene 
(IMAGE) J 

Soares '• 

Soares 

Soares - r 4'^ 
Soares 

Soares, r > " 
Soares 

Soares ^ 
Soares ''^fX^^^ 
Soares - 
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GB Accession Number 




W56398 


Normal Mplannrvfrp . 


N66813 


Normal Melanocyte 


AA379996 


Skin Tumor 


AA370040 


Prostate Gland 


AA369851 


Prostate Gland 


H08822 4 


Brain (Whole infant) 


H08905* V: 


Brain (Whole infant) 


HI 9533 


Brain (Whole Adult) 


H21379 


Brain (Whole Adult) 


H21421* . 


Brain (Whole Adult) 


H24360* 


Brain (Whole Adult) 


nZbl/b 


Brain (Whole Adult) 


nooDoiJ . ■. 


brain (wnoie Adult) 


noo/o i 


Prom /\A/hrvlo Arlitltl 

orain iwnoie aouiij 


d 

H39147 ; 


brain (Whole Adult) 


d 

H39148 


Brain (Whole Adult) 


H45092 c 


Brain (Whole Adult) 


H45054 t 


Brain (Whole Aflult) 


U/IQQOQ 


Brain (Whole Adult) 


HRflAfTC ; v T ; c 


brain (wnoie aquii; 


H51403 2 


Rrain (Whnlp Arlnlt) 


nj i ****** /. _ 


brain iwnoie aquii; 


H528ffS^h 


Brain (Whole Adult) 


H52774Lv. 


Brain (Whole Adult) 


R85542 : ; ^ , ; 


Brain (Whole Adult) 


R84652 


Brain (Whole Adult) 


AA324855 


Rrain fRprphpllitml 

Ul 011 1 JuCI CUCIIUIII/ 


AA317211 


• i. 

Retina 


AA37191kv i 


Pituitary Gland 


AA302113I % ; 


Endothelial Cells, Aor 


, AA247643 i;pfi 


Fetal Heart 


W6Q049 ; 


Fetal Heart 


W61359 < c ; 


Fetal Heart 



Tissue 



Library 



• ; * . ... 



Soares 
Soares 
TIGR 



ta : } >7/.; ; • % 'A-V r 



TIGR 
TIGR 

Soares 

Soares 

Soares^ 

Soares 

Soares 

Soares - 
Soares - 
Soares 
Soares 

Soares 

Spares 

Soares 
Soares 
Soares 
Soares 
Soares 
Soares 
Soares 

Soares, r 
Soares 
Soares 
TIGR , 
TIGR 



l ■ 



TIGR 
TIGR 



U. Toronto j/S^^ 
Soares 



Soares 



Rwc^nrfrv^wn 
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6B Accession Number 




Tissue 


Library 


AA243511 


B-Cells 




Soares 


AA234769 


Pooled; fetal heart, melanocytes, pregnant uterus 


Soares 


A A 158239 
AA 150565 ^ 
AA 160836 I 


Pancreas 
Pancreas 
Pancreas 




Stratagene 
(IMAGE) 

^ Strataoeife^ ^ ; 
(IMA^E) V- V 
Stratagene 
(IMAGE) I; : 


H73822 
N58180 M\V 
W04414 ; 
N94254 

N69644 

T72755 


Fetal Liver Spleen 
Fetal Liver Spleen 
Fetal Liver Spleen 
Fetal Liver Spleen 
Fetal Liver Spleen 
Fetal Liver Spleen 
Fetal Liver Spleen 
Fi?t3l Liver Spleen 




Soares 
Soares 

Soares ^ • ' 
Soares . v : 
ooares 
Soares - 

Soares ^ ' 


T53976 


Pooled Fetal Spleens 


ri • , •- i i ■.. 


Soares : v 


N76701 
N90814; 

N59233 

N532B7iv 

N51545 


Multiple Sclerosis 
Multiple Sclerosis 
Multiple Sclerosis 
Multiple Sclerosis 
Multiple Sclerosis 
Multiple Sclerosis 




Soares M 
Soares : : ^ 

Soares 

Soares 


F22624 


Skeletal Muscle. - 




CRIBMltaly) 



NoterPaired superscripts indicate opposite ends of the same cDNA clone. 

% " i ^ UC331 ^% \v^us^ to q*^^ database inqny mouse 

■* region 

>::^; y f * * similarity with each, other that virtual 
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contig in a process that was identical to that used to create the human contig. The mouse UC331 
virtual contig was also observed to have an ORF at its 5' end and a poly- A tail at its 3 r end. A 
description of the mouse ESTs that were used to construct this contig are provided in Table 10. 

5 TABLE 10 

Mouse 



GB Accession Number 


Tissue 


library 


Clone # y 


AA0274ft7 


riduvllicf 


OUaiuh 




AA023708 


Placenta 


Soares 


45698415') 


AA023154 . ; 


Placenta 


Soares 


456027 (5') 


AA024303 


Placenta 


Soares 


458313 (5') 


W35948- : W - 


Total Fetus 


Spares 


350258(5') 


W 1 1 Do 1 


I o tai retus 


^Spares 


318665 (5') 


W36820 


Total Fetus 


Spares 


336707(5') 


AA002492 


Mouse Embryo 


Spares 


426498 (5*) 


AA097370 


Mouse Embryo 


Soares 


493073(5') - 


AA014313 


Mouse Embryo 


Soares 


468491(5') 


AA450512 


Bedding ton embryonic region 


IMAGE 


865186(50* 


AA408179 1 


Embryo Fetoplacental Cone 


Ko 


C0025F09(3') 


AA408261 1 u 


Embryo Fetoplacental Cone 


Ko 


C0025F09(5') 


AA1 17174 


T-cells ^ - > 


Stratagem 


i 558134(5'): 


AA1 19346 


Thymus 


^Sares 


573567(51 


AA183195 ~ 


Lymph Node 


So^r&s 


636222(50 


AA122933' ' 


Kidney 




579415(57- 


AA423613 


Mammary Gland 


Soares 

■ ■ ;■■]} 


832219(57 











Note:Paired superscripts indicate opposite ends of the same cDNA clone. 

When the MegAlign™ program of the Lasergene™ DNA analysis software package 

wutW, vitE*^^^ coritigs were 

n%i$& ft^spreiUcied^ represent mlEhlA species that ; wer^higKiy sirtiilar an^h^ 

x their lengths. This sirhilarity was most striking in the region comprising the putative ORFs. 

*^ : Withm th^ OllFs die hrouse and human e^ are 89% iden^eal. In the 

■ ' ' . ' ' : ■ ' ' • ■ ft . ' 
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predicted 3' untranslated regions of the tvv6 contigs, the DNA sequence similarity falls to 73% 
with several small deletions and insertions. This higher degree of sequence similarity in the 
putative ORFs as compared to the proposed 3' untranslated region is interpreted as evidence that 
the ORFs encode proteins on which natural selection constrains amino acid sequence divergence* 
5 Like the human JUC33 1 cbntig, the mouse eontig also encodes ai putative QRF thiat extends all the 
. way to its 5' end v This provides additional support for the contention that the UC331 mRNA 
contains more sequences at its 5' end than are represented by the EST based contigs presented 
.here. *■/ 

The ORF^ of the mouse land human UC33 1 contigs Were conceptually translated and the 

10 amino acid sequences were compared: The amino acid sequence of the human UC33 1 ORF was 
used to query the Swiss, PIR and Translation release 101 vising the Lasergehe™ software 
package. For the 157 amino, acids for Which* this comparison is possible, the mouse and human 
sequences are collinear and identical at 15 1 positions (96%) with five of the six differences being 
conservative substitutions. This putative protein domain is highly acidic with 26 acidic and 17 

15 basic amino acids. There were ajsp 48 hydrophobic and 43 pdlar amino acids predicted. When 
either the predicted mouse or human UC331 amino acid sequences was compared to amino acid 
sequences in the rptibliC; protein, sequent data bases, no significant matches were found to any 
previously characterized vertebrate proteins. However, a significant match was observed to a 
putative protein, termed ZK353.1 (PIR Accession number the genome of 

20 the iiematode, Caenorhqbditis elegans. Thei mammalian a is similar and 

collinear with the , C-tennip^- 157 ammo acids ©f the ptitatiVe G elegans p&tein. Like the 
mammalian UC331 ^nuno a^ acid seqtience of the 

ZK353.1 is also highly acidic with 3 1 . acidic and only 20^ba(sie amino afcids. Over the 263 amino 
acids for which a comparison can be made the *ZK353il amino acid sequence is identical to the 

25 human or mouse sequence at 84 (41%) positions with many of the diffetericesf ? reorienting 



., conservative substitutions. , ~ / ; - ^ , ; ' 1 7';* 



The putative C elegan? protein, Its existence 

is predicted from the C. elegans genome sequencing effort (Stilstoii et ah, 1992). The 
polypeptide sequence for ZJO^J^S a ^OQceptulit translation off an a&a on the C: elegans - 
30 chromosome III (GB accession number CELZK35a>. Therpreaicted sequence for ZK353.1 is 



lo 



5 
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548 amino acids long and includes an additional T71 w 

domain with similarity to the oredic. / ofthe 
anty to t he predacted ammo acid sequence of UC331 If urn, • 

ma^ahan^^ lstfce 

entire length, it culd be expected that the ORP of UC331 ^ ^ 
1 100 nucleotides S< of the sequence inSEQlT^ " ^ 

extendsfurmerS'manisaccountedforin* While it is likely that the UC331 ORF 

*u is accounted for m the virtual mouse and human T Fnu 

m <** &™ hmKm f0ly . A plusRNA ^ Wow 

n*NA extends oniy about 350 nnc.eo.ides tahe, 5. This T ' man ^ 

Pane, of 'inRNA processing ^ c ^ " ^ " iMe ' Pretfa8 

fte mammalian and nenu.tode roRMA ! ^ ° H * d,Cate ** 

To confirm that the human UC331 vin.,„i ~ - 

-.^^^^ 

cDNA fragments prndicied ,0 *e eonnnnons ^ ™ R ™^^e 

■xJZSzszz rr~ bu,was **— * * ° f » 

length cDNA,with a GenBank accession number of D87451. 



' " ! ^S^TT" byRNA - ^ * : — 

<<^ means. „ * 

,, nacai tnat the differential expression of all mRNAs identifi^ k ' Wj. i^-' 
, be confirmed as such bv an i„,W v, « identified by RNA fingerprinting 

:««>^.;-- y • sucn oy an independent methodology- To inAw^ ^ w 

, ( expression of UC331 in th. • u . . independently confirm the differential 

" * ' * j ui the peripheral blood of nati-ntc 

compared to the nerl.K . k, / with recurrent metastatic cancer 

f ^ to peripheral, blood 0 f healthy volunteers twn A«k ' * ^ 
quantitative RT-PCR™ were - J * f ° r 3 re,atiVe 

Pools ofcD N A co^r7 K ^ first format 

~ ^ ^ ^ ^ various individuals 

^ similar physiologic states: In this study a C DNA =■ v 

yc-lunteers was compared to a ™,i . Pool representing 8 healthy 

.-..«<<•• ^ ^nipared to a j>ool represettting lamdividiial^^ii ; , •*>.-j- 4 *t<« .• 

cancer A*hi«1«^i - maiviauals with recurrent metastatic prostate 

metastatic breast cancer was also 
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Examined. The inclusion of the breast c^cer patient Samples in this study was made to 
determine if the mRNAs examined were being differentially regulated in the iiriihlune system in a 
response that was specific for prostate cancer or if th6 response was more general to metastatic 
cancer in general. Using these pools of cDNA as templjates,; triplicate PCR™ was performed. 
5 Each 6f the three replicates were terminated at a different cycle number of PGR™. This format 
•*V of relative quantitative RT-PCR™ insures that the results taken for relative quantitation represent 
• the PGks 174 when they are in the log linear poitioiis of their amplification curves where such 
quantitation is most accurate. 

Approximately 1.5-5.0 jig of E)NA-ffee total RNA from the peripheral blood of healthy 
10 ^volunteers or patients with either metastatic prostate 6f breast cancer were converted into first 
stratod cDNA using the SuperScripf™ PreampUficatiori Syhem for First Strand SDN A Synthesis 
;{GIBC;6-BRL, Cat# 18089-61 1) followinjg ihe directions provided by the supplier. These 
cDNAs were then normalized to contain equal concentrations of amplifiable cDNA by PCR™ 
amplification of pectin cD^NA using the primers 5' GGAGCTGCCTGACGGCC AGGTCATC 3' 
15 (SEQ ID n6:28) mid 5^ GAi^CAf^ 3* (SEQ ID NO:9). A 

typical PCR™ program wOuld be 94°C for 1:15 min, followed by 22 cycles of 94°C for 45 sec, 
55 P C for 45:seb and 72 b C for 1:15 miii. This was followed by final extension of 72°C for 5:00 
min. PCR™ products were visualized by gel electrophoresis through 1.5% agarose TAB gels 
Stained with ethidium bromide. Images of llie gels were captiired, digiti!z£d and analyzed using 
20 the IS-I OOO Digital Imaging System (Alpha IftnbtecfrC^ of the cDNAS 

were adjusted by adding various amounts 6f water to create cDNA stocks that contained equal 
concentrations of amplifiable pi-actin ct)NA. Typically, the cDMA derived from the reverse 
V fi trahseripti0n bf 5 0 jig of RNA resulted in ^noUp ri^oirmalized fcDNA to perform 50-200 
•' ^ ;> 3R^Pd^ reactions^ ^..ul-y^-/- • ■•■ — " $. l -rr--- -j* . 

25 • Equal amounts of the normalized cDNA MOgk fi^M Ihdiinduals having the same disease 

• state were pooled. Pools of cDNAs from healthy VoluiUeers, patients with metastatic prostate 
- eaiicer and metastatic breast caiicer \vere pitoducfetL Thes<b pools w^re theh examined by PCR™ 
for p-actin to determine that they contained eijfcial ajftbunt^ 6Canaplifiable cDNA* > 

- Tb demonstrate that all observations ^ of the PCR™ 

30* amplification dtirVe, a series of PCR™ rdifctioris ^ performed ott 
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each cDNA pool for each gene {primer pair) examined. Display of the PCR™ products bh 
electrpphoretic gels and analysis with the IS 1000 Digital Imaging System illustrates that the 
mass of the PCR™ products is increased exponentially with increasing cycle number, confirming 
that the observed results are in the log-linear portion of the PCR™ amplification curve, 
5 Relative quantitative RT-PCR™ showing near equal amounts of amplifiable P-actiri 

cDNA in three pools cDNA. Pools of normalized cDN As were constructed from peripheral 
blood RN As from eight healthy volunteers, ten individuals with recurrent metastatic prostate 
cancer, or ten individuals with recurrent metastatic breast cancer. .Three separate PCR™ 
reactions were performed on each pool of cDNA. PCR™ was terminated at differing cycle 

1 0 numbers (cycle 22, cycle 24, and cyple 26), and the products were visualized by electrophoreses 
and ethidiumn bromide staining. Images were captured and quantitated using a digital image 
analysis system.: At all three sygle numbers examined, there are relatively similar band 
intensities representing the three cDNA pqols and increasing band intensity with increasing cycle 
number, verifying that the observations are being made in the log linear range of the 

15 amplification curves. Similar band intensities indicate similar relative concentrations 1 of P-actin 
mRNA in the RNAs from individuals from which these cDNA pools were constructed. 

The oligo^ucleptides used } in thg relative quantitative RT-PCR™; studies that 
independently confirmed the differential expression of UG331 were designed from the sequence 
in the human UC33 1 virtual contig. These UC331 specific oligonucleotides had the sequences of 

20 l 5; CrGGCCTACGGAAGATACGACAC 3' (SEQ JD , x ?fp:31) and 5' 
ACAATCCGGAGQCATGAQAAACT 3' (SEQ ID lsfO:32). These oligonucleotides direct the 
:-o-;<, amplification of a 27^ nucleotide ; long PpR™ product ^ UC33L The 

,.}'?}£■:• '■' pUgonucleotides us^^ in the relative quantitative RT-PCR7^ studies that independently 
confirmed the differential expression of UC332 were designed using the sequences of the cDNA 

25; with the GenBank ac^essi^ri number D87451. These UC332 specific pligoj^cleotides had the 
^p^es - ; AGQCCGQGCCT 3' (SEQ ID Np:33) and 5* 

; f ^ {$MQ JE> ^034fc These oUge^d^^des/ direct the 

r • ... .... .... ..... ..... . ... 

amplification of a HP nujcle^tide long PCJR™ product that is ^et^fic J^ pC332. < - 

Th^fesults for relative levels^ those observed 

30 ?, v ^^^ligo^ were used to direct ^CR^ a^ (FIG* 
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7). At 25 cycles of PCR™, clear bands are visible in the lanes representing the pools of cDNA 
from peripheral blood of patients with either metastatic breast or prostate cancer. In the lane 
representing the peripheral blood of healthy volunteers, only a very faint band is jir&sent. At 28 
cycles of PCR™, the band intensities representing all three "pools are brighter than they were at 
5 ?25 cycles, bui the retail ve increase in intensity of the bands representing the metastatic cancer 
patient pools compared to the healthy volunteers remains the saihe as was observed at 25 cycles 
Wf' PCR™. This indicates that these observations are being made in the log linear range of the 
PCR™ amplification curves. At 3 1 cycles of PGR™, there is still an increase in the intensity of 
^ the bands representing the pools of metastatic cancer patients compared to the pool representing 
10 ^the healthy volunteers' but a quantitative analysis of these bands indicates that the PCRs™ have 
"left the log linear range of their amplification curves. Quantitation of the data for 25 and 28 
cycles of PCR™ independently confirms that UC33 1 mJRNA is differentially regulated and is 
roughly seven fold more abundant in the peripheral blood leukocytes of the average patient with 
either recurrent metastatic prostate Cancer or breast cancer than in the peripheral blood 
15 leukocytes of healthy volunteers. 

The second format of relative qjuahtitative RT-PCR™ used to examine the differential 
expression of OC33 1 exaniined the relative aibuhdance of UC33 1 mRNA in the peripheral blood 
of Wealthy individuals or individuals with recurrent metastatic cancer. The individuals examined 
in this study wer£ the same as thoste whose cDNAs were conibiri£d to construct the pools 
20 exaimiifed M dfescribe<i Sbove. Using the i^orinatioii obtained from the pooled cDlSfA study to 
predict at what %CR™! cycle riunibc^s relative ' (quantitative RT-ECR** 1 would be most 
^ UC331 
0:1 " "^^^□tettjNiS^^ Ip^&^eni^ltt - jpfc^^^^iral- blood te^ofe^^'^i^^'^^ for 22 ^^^^'J^-^^dsuis 
• -^miBfed Contend of am^lifiable pectin ctfNA. ^^^t^^a^Bxcnees 

25 rr: i^i^ p-actitf bhfid ii^efiraty oba^ved in thi^ ^tudy are pi^b^^ dtad tb the internal Variataoiit inheteiit 

that 

^ei^6al itpifcates of a pectin PGR™ can be exp^tedr to vary in fhd iiitehsify df product bands 

' 0C331 cDlSFA^^ 

30 • ^ti^iaiMi Wdiri IUS1A isolated froml^e ^rij^y Ifldbd of eight h^thy Vbl^^ 1% 
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ten individuals with recurrent methstatic prostate cancer (group P), or ten individuals with 
recurrent metastatic breast cancer (group B). PGR™ was for 30 cycles. As yvas seen in the study 
using the pooled eDNAs, the results of the relative quantitative RT-PCR™ for UC331 using 
cDNA from individuals contrasts sharply with that qbserved for pectin. The intensity of the 
5 band representing the abundance of the UC331 mRNA in peripheral blood leukocytes was 
greater for all of the patients with either metastatic prostate or breast cancer as compared to the 
intensity of the UC331 band representing the mRNA level in the peripheral blood leukocytes of 
healthy volunteers- Therefore, the elevated UC33 1 mRNA levels indicated by the relative 
quantitative RT-PCR™ results using the pooled cDNA templates was caused by an elevated 

10 mRNA level in all individuals comprising the pools and not from a subset of individuals with 
very high elevations in UC331 mRNA levels. This study is a second independent confirmation 
of the differential expression of the UG33I mRNA. 

As is indicated by the wide distribution of tissues from which the ESTs used to assemble 
the UC331 contigs (Table 9), UC331 is widely expressed in many tissue and cell types. 

15 However, because most of ESTs comprising UC331 are from normalized libraries, little, 
information can be gained from this data on the relative abundance of the UC331 mRNA in 
different tissues. Also, while the extension of the QRFs of the mouse and human UC331 contigs 
all the way to their 5' ends and the similarity of mammalian UC331 mRNAs to a much larger 
putative C elegans mRNA both predict that the mammalian UC331 mRNA extends even further 

20 5=', the exact size of the UC33 1 mRNA was unknown. To address alj of these issues, ^Northern 
blot of poly-A plus RNA from eight different human tissues was probed with the ,850 nucleotide 
long RT^ECR™ prodiuct de§cri^ labeled with 32 P. Approximately 2.Q ixg pf pply-A plus 

: RNA from spleen, thymus,^ small intestine, colon, and peripheral blood 

leukocytes were loaded in each lane. TJC33 1 mRNA is expressed in all eigjit human tissue and 

25. P.^;lyp^s ? -(Sj^/^aQdavds- indicate a message size of approximately ■ 1 .75 ^k£. Interestingly, 
; ? v ^ biit is highly express;ed . 1A the thymus, 

v>y . ^e^QiistratiAg a difTeren^ m expression between cells of different developmental steges in the 
immune system. UC331 is most abundantly expressed in thejte^ps^ about 

, % J- IJS kb which indicated that the mRNA only extends ^bout 350 nucleotides further 5' than is 

30t ^ HD* NQ2^ of the 
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virtual coritig is shown in SEQ ID NG:3d. Cleairly, the putative C. elegaris mRNA extends much 
more 5' than do the mammdian iriRNA species; 

The other jgehe identified as being differentially regulated in this RNA fingerprinting 
sttidy was UC332. UC332 was analyzed in much the same way as UC331 was. When the 
5 sequence of the cDNA fragment from the RNA fingefprintirig gel representing UC332 was used 
io query GenBank, no ESTs were identified; The sequence of the UC332 cDNA fragment did; 
• ■ ■ however, i^entily a sequence of a full lerigth cDNA; KA000262 (GB:accession number 
1587451). The sequence of KA006262, (hereafter referred to interchangeably with the name, 
r UG332) was determined as part of a project to examine previously unidentified mRNAs 
10 expressed in the bone marrow myeloblast cell liiie; RG-1 (Nagase et aL 9 1996), This mRNA 
contains' an ORF encoding a putative protein with 76i ^mo acid sequence. Perhaps the most 
striking feature bf this polypeptide sequence is the appearahce of a C3HG4 RING zinc finger or 
; RING finger motif (Freerriont, 1993) located between amino acids 175 and 216. The RING 
finger domain binds two zihc ions in a conserved structure that has been resolved (Barlow et al, 
15 1994). RING finger domains have beehf ideiitified in dozens of proteins derived froiri eukaiyotes 
as diverse as yeasts, flies, birds, nematodes and humans. In most of these cases, the RING finger 
containing proteins have been shown to be essential for some important biological process 
although the these processes vary considerably one from another; Among; these mammalian 
encoded RING fihger proteins are Several gerie£ implicated in th6 ^ntog^hy of cancer including 
20 the^re/ vir&l oncogerie (Takahashi e/ a/. , T988) ^d ^riii-1 /a gene wftose product collaborates 
with myc induced transformation (Haupt et al y 1991). Tlie BRGA-1 tumor suppressor gene 
involved in hereditary breast arid ovarian <^ter susceptibility conia a RING fiiigeif domain 
^ CMila io/., 1 994), arid MAT-I , a novel 36 kDa RING finger prbteiii, is fequired for the 
^ " assembly of erizyrftatically active CDK7- cyciin-H coniptekes (Tassan et al. , 1 995), A 
25' ^ comparison of the RING fiiiger domains of UG332 and various r^i^eseritetiVe ? meinWfs of this 

indicates the 

r ^ fingfc^dtfih^^ in the ^iiffit>r suppressor 

m s g&ie, fiRCA 1 , arid the; T cell repressor <>f t^ahscriptiori protein, *ipt- 1 However, BRGA 1 and 
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Proteins with RING finger motifs exhibit heterogeneity in their subcelluar localizations. 
Some, that are important regulators of differential gene regulation, localize to the cell nucleus. 
When the amino acid sequence of UC332 was scanned for evidence of subcellular localization, 
two domains were identified that contained sequences for putative nuclear localization signals 
(NLS). NLS are highly basic, stretches of six are more amino acids of which at least four are 
basic that tend to be flanked by acidic amino acids and/or prolines (Boulikas, 1994). Both of the 
putative NLS in IJC332 longer and more basic than the minimum requirements for the, consensus 
NLS, motif. The first of these putative NLS motifs occurs between amino acid 548 and 567. 
Within this domain, 13 of 19 amino acids are basic. In fact, this domain could be viewed as two 
NLS in tandem separated by two glutamic acid; residues, If divided this way, the first NLS 
domain would have % of eleven positions as b^sic amino acids while the second motif would 
have 5 of 6 amino acids being basic. The second NLS motif in UC332 is located near the 
C-terminal end between positions 739 and 750 in the amino acid sequence. This domain has 8 of 
12 amino acids as basic residues, with a core of 5 consecutive lysines and arginines. The 
presence of these putative NLS in the amino acid sequence of UC332 suggest the possibility that 
UC332 plays an important role in regulating the expression of other genes. Finally, the amino 
acid sequence of UC332 lacks a signal sequence for cellular export or, an obvious hydrophobic 
transmembrane domains. • t 

. To independently verily that UC332 mRNA is more abundant in the peripheral blood 

leukocytes of patients with recurrent metastatic cancer as compared to the peripheral blood x 

^. > ...... .. , , - . • 

leiAopytes of healthy volunteers, ^relative quantitative RX^PCRTW was performed using the same 
cDNAs and formats wp^used to inye^gate the differ of UC331 A relative 

quantitative RT^PCR^^dy Rising UC332 specific oligonucleotide primers and cI>N^ pools as 
,^mplat^ ;w^ ^nduct^, . At 25 and 28 cycles of PGR™, the amplified DNA hand representing 
titoe^ is stained mote intensely for. those reactions that 

iK»ed cDN^ template pools constructed from the peripheral bipod J^^^ 
-metastatic .prostate, and breast cancer paints ^ : ;Cpmp^ed to a similar pool cpnsto 
RNA from healthy volunteers, images using Jhe IS T I00Q Digital Imaging 

System (Alpha lnnotech, Inc.) indicates th^^ 
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the peripheral blood leukocytes of metastatic cancer patients compared to healthy volunteers. At 

31 cycles of PCR™, the reactions have left the log linear range of their amplification curves. 

In a second relative quantitative RT-PCR™ study using UC332 specific oligonucleotide 

prirners, peripheral blobd leukocyte cDNA from the individuals that comprised the pools from 

5 : ' the peripheral blood of eight healthy volunteers, ten individuals with recurrent metastatic prostate 

^ " ^ cancer, or ten individuals with recurrent metastatic breast cancer were examined separately. 

PCR™ was for 26 cycles; Hie results of this study are similar to those obtained when the pooled 

dDNAs were used as PCR™ templates. All of the canter patiferits had higher levels of UC332 

mRNA in their peripheral blood leukocytes than did any of the healthy Volunteers. 

10 In this study, the inventors showed that UC332, encoding a RING finger protein, is up 

regulated in the peripheral blood leukocytes of patients with either recurrent metastatic breast or 

prostate cancer. From the literature, RING finger proteins have been shown to participate in the 

regulation of several important lymphocytic processes (Patarca et al y 1988; Fridell et at , 1995; 

Takeuchi et al., 1996; van Arsdale et ah ? 1997; Nakano et al y 1996). The observed differential 

15 regulation of the RING protein encoding mRNA, UC332, in the immune response of patients 
( - # ""1 m 

with metastatic breast or prostate cancer, strongly suggests that UC332 participates in regulating 

.this immune response. 

All of the compositions and methods disclosed and claimed herein may be made and 
executed without undue experimentation in light of the present disclosure. While the compositions 
20 and methods of this disclosure have been described in terms of preferred embodiments, it is 
apparent that variations may be applied to the composition, methods and in the steps or in the 
sequence of steps of the method described herein without departing from the concept, spirit and 
scope of the invention. 

More specifically, it is apparent that certain agents which are both chemically and 
25 physiologically related may be substituted for the agents described herein while the same or similar 
results would be achieved. All such similar substitutes and modifications apparent to those dolled 
in the art are deemed to be within the spirit, scope and concept of the disclosure as defined by the 
appended claims. . • 

UC325-1 is derived from the IL-8 gene (Genebank Accession #M28130). UC325-1 and 
30 UC325-2, an alternatively spliced form that includes the third intron of the IL-8 primary transcript, 
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are transcribed from the IL-8 gene. Our definition : - ? of IL-8 gene products means all mRNAs 
transcribed from the IL-8 gene, the polypeptides encoded by those mRNAs and their post- 



Those practiced in the art will realize that there exists naturally occurring genetic 
5 variation between individuals. As a result, some individuals may synthesize .IL-8 gene products 
that differ from those described by the sequences entailed in the r Genebank number listed above. 
We include in our definition of IL-8, those products encoded by IL-8 genes that vary in sequence 
from tho.w described above. Those practiced in the art will realize that modest variations in DNA 
sequence will. not .significantly obscure the identity of a gene product as being derived from the 
10 IL-8£ene. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: ^ : 

|i) APPLICANT: . V. 

(A) NAME i UROCOR, Inc. </ . 

(B) STREET: 800 Research Parkway 

(C) CITY: Oklahoma City ,.. ; ; i ; :•:>" 

(D) STATE: Oklahoma 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 73104 

(ii) TITLE OF INVENTION: DIAGNOSIS OF DISEASE STATE USING mRNA 
PROFILES 

(iii) NUMBER OF SEQUENCES: 34 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER : IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0 , Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS J 

(A) LENGTH: 253 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GGCAGGGGCT TGTGACTCTA AGATGGCTTC ATTCACATGC CTAGGGCCTC AGTAGG ATGA 60 

CTGGCATGGC CCTGGAAAAC TGCGAAGTCT TCTCTCTGTG OUUVCTTTCA CCTGGACTTT 120 

TTATATGATT CTGGAAGTAT TCCAAGAAGG CAAAAGTAAA AACTGCAAAG CGTCTTAAAA 180 

TACAAGTTCA GAAGCCACAT TATATCACTT CTGTTGCATT CTATCAAAGC AAGTCACAAG 240 

CCCCTGCCAA TCA 253 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 183 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
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CACACACTCC CCCATTCTGA GCCCCAAGAG GCTCATCCCT AAGGATGTCC AGAGATCCAA 60 

GTGGAGAAGG AGAATGTGGT GAGGCTATTT ATTCCCCCAG TGCCTTCCCT GCTGGGCTAT 120 

GGATGAACAG TGGCTGACTT CATCTAGGAA AGAGCTATGG CTTCTGTCTC CTGGAGCTCA 180 

CCA 183 

(2) INFORMATION FOR SEQ ID NO: 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 387 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
X (D) TOPOLOGY : 1 inear 

(xi) ^SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ggtgagCccc : AGGAGAGAGA AGAGATATGA GGAAATTGTT AAGGAAGTCA GCACTTACAT 60 

TAAGAAAATT GGCTAGAACC CCGACACAGT AGCATTTGTG CCAATTTCTG GTTGGAATGG 120 

TGACAACATG GTGGAGCCAA GTGCTAACAT GCCTTGGTTC AAGGGATGGA AAGTCACCCG ... 180 

TAAGGATGGC AATGCCAGTG GAACCACGCT GCTTGAGGCT CTGGACTGCA TCCTACCACC 240 

AACTCGTCGA ACTGACAAGC CCTTGCGCCT GCCTCTCCAA GGATGTTCTT ACAAAATTGG 300 

TGGTATTGGT ACTGTTCCCT. GTTTGGCCGA ATTGGAAAAC TGGi^TTCCT CCAAACCCCG. 360 

GTTATGGTGG ; GTTTCCTCCT CCTTGGA 38"? 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS : \ 

(A) LENGTH: 366 base pairs 

(B) TYPE: nucleic acid ■ - ^ " : ' 

(C) STRANDEDNESS: single ; ; 

(D) TOPOLOGY: linear : ' v ' " '■' " ■ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GGGCGGAACA AGGGAGCGCT AAAAGGAAAT TAGGATGTCA GG^TOCATAAA GGAACATAAT SO 

TCCAAAACCT TTCCAAACCC CAAATTTATT CAAAGGAACT GAGGAGTGGA TTGAGG AGTG 120 

GACCAACACT GGCGCCAAAC ACAGAAATTA TTGTAftAGOT : TOC^ 180 

GTCTGGGCCC CAAGGAAAAC TGGGTGCAGA GGC?TTGTG<^^ 240 

AGAATTCATA AAAAAATTCA TTCTCTGTGG TATCC&A&AA t^^GlkSAAGK' TGC&VQTGAA 300 
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ACTTCAAGCA AATCTACTTC AACACTTCAT GTATTGTGTG GGTCTGTTGT AGGGTTGCCA 360 

GTTGTT 366 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 598 base pairs 
(8) TYPE: nucleic acid 
(C> STRAND EDNES S : single 
• v (D> TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 5: v 
GCTTGGGCCC CAAGGAAAAC TGGGTGCAGA GGGTTGTGGA GAAGTTTTTG AAGAGGTAAG 60 
TTATATATTT TTGAATTTAA AATTTGTCAT ^TATCCGTGA GACATATAAT CCAAAGTCAG 120 
CCTATAAATT TCTTTCTGTT GCTAAAAATC GTCATTAGGT ATCTGCCTTT TTGGTTAAAA 180 
AAAAAAGGAA TAGCATCAAT AGTG AGTGTG TTGTACTCAT GACCAGAAAG ACCATACATA 240 
GTTTGCCCAG GAAATTCTGG GTTTAAGCTT GTGTCCTATA CTCTTAGTAA AGTTCTTTGT 300 
CACTCCCAGT AGTGTCCTAT GTTAGATGAT AATGTCTTTG ATCTCCCTAT TTATAGTTGA 360 
GAATATAGAG CATGTCTAAC ACATGAATGT CAAAGACTAT ATTGACTTTT CAAGAACCCT 420 
ACTTTCCTTC TTATTAAACA TAGCTCATCT TTATATTGTG AATTTTATTT TAGGGCTGAd 480 
AATTCATAAA AAAATTCATT CTCTGTGGTA TCCAAGAATC AGTGAAGATG CCAGTGAAAC ^ 540 
TTCAAGCAAA TCTACTTCAA CACTTCATGT ATTGTGTGGG TCTGTTGTAG GGTTGCGA 59& 

(2) INFORMATION FOR SEQ ID NO: Si 

(i) SEQUENCE CHARACTERISTICS: \^'t.::'^^^:^-^^ 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid r - : ;^ I \ 

(C) STRANDEDNESS : single • \.V 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CGCCTCAGGC TGGGGCAGCA TT ^" AA ; ..:r *i\ry:C:&:-. V- V .' J " 22 

(2) INFORMATION; FOR SEQ ID NO \:'^v]' ; r. . . ^^.yr : ; ;. v^-v-'-: ->Ay,,; -V t*. : - 

<i) SEQUENCE CHARACTERISTICS s- 

(A) LENGTH: 25 base pairs . v 

(B) TYPEs nucleic acid 
id) STOANDEDNESS : single 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
ACAGTGGAAG AGTCTCATTC GAGAT 25 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic* acid 

(C) STRANDEDNESS : single r 

(D) TOPOLOGY : linear 

- (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8: 
CGAGCTGCCT GACGGCCAGG TCATC 



(2) INFORMATION FOR SEQ ID NO : 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 25 base pairs 

(B) TYPE: nucleic acid . 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GAAGCATTTG CGGTGGACGA TGGAG 7 ; ~ : v 25 



(2) INFORMATION FOR SEQ ID NO : 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single "\ ry .; 

(D) TOPOLOGY: linear .." . ^ 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

TGCAAACTTT CACCTGGACT T "V. * - . v..".- . 21 

(2) INFORMATION FOR SEQ ID NO: H: \^/^S^'-'-V 1 *' 

(i) SEQUENCE CHARACTERISTICS: ; f ^ ? . 

(A) LENGTH: 24 base pairs ' 'hxfps:\, 

(B) TYPE : nucleic acid . -"-...<•.••• 

(C) . STRANDEDNESS : single ^ . ... ' v * 4 - f 

(D) TOPOLOGY: linear ,, , /■ r : .\-\ci: . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
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CTTGTGACTT GCTTTGATAG AATG 



24 



(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION : SEQ ID NO: 12 : 
GACAACATGC TGGAGCCAAG TGC 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY :* linear 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

CD) TOPOLOGY: linear '.' 
(xi) SEQUENCE DESCRIPTION : SEQ ID KQ: 14 : V\ 
TGTCCAGAGA TCCAAGTGCA GAAGG . 25 

(2) INFORMATION FOR SEQ ID NO: 15: - 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



ACCACCAATT TTGTAAGAAC ATCCT 



25 



(2) INFORMATION FOR SEQ ID NO: 14: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 25 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 




(xi) SEQUENCE DESCRIPTION: SEQ ID /-NO* l£ '^ [y-' ' 




GAGCTCCAGG AGACAGAAGC CATAG 



25 
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(2) INFORMATION FOR SEQ ID NO: 16: \ rv T; 

(i) SEQUENCE CHARACTERISTICS: V . 

(A) LENGTH: 17 base pairs 

(B) TYPE : nucleic acid _ : 

(C) STRANDEDNESS : single . v. 

(D) TOPOLOGY: linear ^ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GGGCCCCAAG GAAAACT 17 

(2) INFORMATION FOR SEQ ID NO: 17: ; V' J } 

(i) SEQUENCE CHARACTERISTICS: '^:}-:Li^:: 

(A) LENGTH: 19 base pairs ; * 

(B) TYPE: nucleic; acid i ^.; ^ 

(C) STRANDEDNESS: single J " \ ! 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TGGCAACCCT AGAACAGAC ' .* : - 19 

(2) INFORMATION FOR SEQ ID NO: 18: : • : 

(i) SEQUENCE CHARACTERISTICS: 4 * I . i'- .*'_- 

(A) LENGTH : 17 base pairs * . > \ 

(B) TYPE: nucleic acid :: 

(C) STRANDEDNESS: single ' 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

GGGCCCCAAG GAAAACT / " " • 'P'H P*''- V : '- : ->\\ ' 1*7 

(2) INFORMATION FOR SEQ ID NO: 19: -I \ f J':.- 

(i) SEQUENCE CHARACTERISTICS: * I; \. : o^^-^f ^i;^-^; • < : ,: ^ N * 

(A) LENGTH : 20 base pairs ''-r* x \? :• ;' ; : . ' \" 

(B) TYPE: nucleic acid fj . ..■ s , : :-;/ : - t . %: *^.. 

(C) STRANDEDNESS s single ^ £ 

(D) TOPOLOGY: linear 



TGGCAACCCT ACAACAGACC { ^ " ~ "20 



(2) INFORMATION FOR SEQ ID NO: 20: 
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(i) SBQUENCK CHARACTERISTICS : . 

(A) LENGTH : 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

.i ,-- • • V -* 

(xi) SEQUENCE DESCRIPTION r SEQ ID NO; 20: ; 
ACATTGAAGC ACTCCGCGAC 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2i : \ 
AGAGTGGCAG CAACCAAGCT 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



GCCTCAGGCT GGGG CAGCAT T 



21 



(2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 




(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 





GGTCACCTTC TGAGGGTGAA CTTGC 



25 



(2) INFORMATION FOR SEQ ID NO: 24: 




(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
ACGACTCACT ATAAGCAGGA 



20 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

/ ~ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
AACAGCTATG ACCATCGTGG , .. .. 



20 



(2) INFORMATION FOR SEQ ID NO: 26 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
. (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) .-SEQU^PE ; DESCklPTION: SEQ ID NO:; 26 J 
ACGACTCACT ATGTGGAGAA 



20 



(2 ) INFORMATION 3Pt)R SB** ID NO : 27: 

( i ) ^SEQUENCE , CHA^CTERIiSTIC^ : 

v (B) ? TYPfi z nucleic acxd K 
(C) STRANDEDNESS : single 



(D) TOPOLOGY : linear 



(xi) SfeQUENCE 
AACAGCTATG ACCCTGAGGA 



S&S ID NO: 27f > 



20 



(2) INFORMATION F&R SEQ ID NO: 28: 



(i) SEQUENCE -CHARACTERISTICS: 
' V ^ A) • ^iEiiId^i: ^ 

(B) TYPE: niicleic acid 
s . (C) STRANDEDNESS : single 
(D) TOPOLOGY ? linear 



.-p% v. 



WO 98/24935 



134 



PCT7US97/22105 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GGAGCTGCCT GACGGCCAGG TCATC 25 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1599 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single . . . 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS '" r 

(B) LOCATION: 115. .744 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

GCGGCAGGCG CGGCAAATTA CGTTGCCGGA GCTGAACGGC GCGGCTGGTC TGAAGGCAAA 60 

CAAGCGAGCG AGCGCGCGAT AGGGGCCGAG AGGACGCGCA GGTGGCGGCG TTGC ATG 117 

Met 

1 - 

TCG CAC GGT CAC AGC CAC GGA ATG GGT GAC TGC CGC TGC GCG GCC GAA 165 
Ser His Gly His Ser His Gly Met Gly Asp Cys Arg Cys Ala Ala Glu 
5 10 15 

CGG GAG GAG CCG CCC GAG CAG CAC GCC ATG5 GCT ACG CTG. TAC CTG CGC 213 
Arg Glu Glu Pro Pro Glu Gin His Ala Met Ala Thr Leu Tyr Leu Arg 
20 25 30 

ATC GAC CTG GAG CGG CTG CAA TGC CTT AAC GAG AGC CGC GAG GGC AGC 2&1 
lie Asp Leu Glu Arg Leu Gin Cys Leu Asn Glu Ser Arg. Gflu Gly Ser 
35 40 ' 45 

GGC CGC GGC GTC TTC AAG CCG TGG GAG GAG [030 i^C^S^ AAG 309 

Gly Arg Gly Val Phe Lys Pro Trp Glu Glu^.Ar^'' ; -ltar ; -As^ ;; Ar^v^Sea^^ Lys 

50 ' 55 : " 60 ■■ \\ . V[ ^ ■ / r "\ '.65 

TTT GTT GAA AGT GAT GCA GAT GAA GAG CTT Cite TTT MT ATT CCA TTT 357 
Phe Val Glu Ser Asp Ala Asp Glu Glu Leu Leu Phe Asn XL& Phe 

70 ' J 75 ' ''" r "'' 80 "' 

ACG GGC AAT GTC AAG CTC AAA GGC ATC ATT ATA ATG GG^ GAG GAT GAT . 405 

Thr Gly Asn Val Lys Leu Lys Gly He lie He Met Gly Glu Asp Asp 

85 90 v-.-r'X.:. \^^-^>^v^^ : 'X'' '"• : 

GAC TCA CAC CCC TCT GAG ATG AGA CTG TAC i\AGvi^T ^"T ? CCA CAG - ATG , 453 
Asp Ser His Pro Ser Glu Met Arg Leu Tyr ' -Lys j / Il<e v Psro v <3ln Met 
100 105 ' ^J ^ ir 1 ^ ' ^: =>^':" v^-> 

TCC TTT GAT GAT ACA GAA AGG GAG CCA 501 
Ser Phe Asp Asp Thr Glu Arg Glu Pro Asp Gin Thr Phe Ser Leu Asn 
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115 120 125 

CGG GAT CTT ACA GGA GAA TTA GAG TAT GCT ACA AAA ATT TCT\ CGT TTT 549 
Arg Asp Leu Thr Gly Glu Leu Glu Tyr Ala Thr Lys lie Ser Arg Phe 
130 135 140. 145 

TCA AAT GTC TAT CAT CTC TCA ATT CAT ATT TCA AAA AAC TTC GGA GCA 597 
Ser Asn Val Tyr His Leu Ser lie His lie Ser Lys Asn Phe <Gly Ala 
15T) 155 ±60 

GAT ACG ACA AAG GTC TTT TAT ATT GGC CtG AGA GGA GAG TGG ACT GAG 645 
Asp Thr Thr L^s Val Phe Tyr lie Gly Leu Arg Gly Glu Trp Thr Glu 

165 170 175 : \ 

CTT CGC CGA CAC GAG GTG ACC ATC TGC AAT TAC GAA GCA TCT GCC AAC 693 
Leu Arg, Arg His Glu Val Thr lie Cys Asn Tyr Glu Ala Ser Ala Ash 
180 ' 185 190 

CCA GCA GAC CAT AGG GTC CAT CAG GTT ACC CCA CAG ACA CAC TTT ATT 741 
Pro Ala Asp His Arg Val His Gin Val Thr Pro Glh Thr His Phe He 
195 200 205 

TCC TAAGGGCTGG CCAAGGCTCC CATAGAGGCG CTGTGTCAGT GAAGATGTAC 794 
Ser 

210 t , / . 

GACTACCTGT TGGGAAGGAC AAAGGGATGA GGCTCCAGAG AGAGTTGGCT GCCACAGCTC 854 

TGCCAAGCTT TGTCTTTGG<i GCTTGCTGCA GAAACCTGGC CTACGGAAGA TACGACACCA 914 

CTGGGAGGGT T^TGTA(^TG ^CCAGG»GGACC , ATCGTGC3TTC TCTAQGGCGC TGTGGAAATT 974 



-■ <■ ... 



GGGTCTTGGG CTGGGTGGCA TCTGGCAGTC ATGGGTAAGA CTTGCTTTTC CAGTTAATGT 1034 

GGCCATGTGA TTCCAAGTGT CATGTTdCTT TGTGGAAGAT TGTTGTGTGA CT^GTTT^PTT 1094 

TGATTTTGTA TTTGTT*TTTT TAAAGGAAAC TATTTGTQGG CTATAGGAAA GTTrTCTGATG 1154 

CCTCCGGATT GTGTTAGTAG TAGCCATCAG GAGGGTCTCC. AACTTAAAACA CTTGTTCCTG 1214 

CTTGCTCCTT TCCCCTCTCiA TT^TOCAdCA TTCTTGTCAA GTTGCCCAqfi TTGGAGTTGT 1274 

CTGTCACGCA CATCrGTCCT GTQGTTATAG CTAGAAGGAC AGGAGTCTCC TGCTGATGCG 13,34 

TGATAGCTTA AGCTTt^SGA GAAGGTCTTT TCCAC^GCCT AGCTAAGCAG TCTCGGGAGA 1394 

GCATGGGGAT 7 & ATCl?GGTGAG TAAGATTGAG AGTTAC&TAA 1454 

GATTCCCCTT GGAAATTCCT TAATGTTTAT TAGCTTCTAA CTAGTGTTGT AAGTCCGATG . 1514 

CCAGAATTTG GAGATTTGAG TT C TT C CTTT CATGGCTTTT ATTCACTGTG ACTAATAAdC 1574 
TTCCTAATAA ATCCTTGCCA GACTT , • . ^ 5 ? 9 
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(2) INFORMATION FOR SEQ ID NO : 3 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH': 2X0 amino acids 

(B) TYPE: amino aCld 

(D) TOPOLOGY : linear 

V 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION; SEQ lb N6: 30: 



Met Ser His Gly 
1 


His 
5 


Ser 


His 


Gly Met Gly Asp Cys Arg Cys Ala Ala 
10 15 


Glu Arg Glu Glu 
20 


Pro 


Pro 


Glu 


Gin 


His Ala Met Ala Thr Leu Tyr Leu 

25 . 30 


Arg lie Asp Leu 
35 


Glu 


Arg 


Leu 


Gin 
40 


Cys Leu Asn Glu Ser Arg Glu Gly 
45 


"Ser Gly Arg Gly 
50 


Val 


Phe 


Lys 
55 


Pro 


Trp Glu Glu Arg Thr Asp Arg Ser 

60 : - 


Lys Phe Val Glu 
65 


Ser 


Asp 
70 


Ala 


Asp 


Glu Glu Leu Leu Phe Asn He Pro 
75 80 


Phe Thr Gly Asn 


Val 
85 


Lys 


Leu 


Lys 


Gly He He lie Met Gly Glu Asp 
SO ' 95 


Asp Asp Ser His 


Pro 


Ser 


Glu 


Met 


Arg Leu Tyr Lys Asn lie Pro Gin 



100 105 110 

Met Sen Ph& &sp Asp Thr Glu Arg Glu Pro Asj> Gin Thr i>he ^ Ser Levi " 

I, \ 120 .. 125 ......... .. 

Asn Arg Asp . Leu Thr Gly Glu Leu Glu Tyr Ala Thr Lys lie Ser Arg 

130 - ■ ■■■ ■v' 135" - 140 'P- 

Phe Ser Asn Val Tyr His Leu Ser lie His He Ser Lys ^ri Phe Gly 

Ala Asp -Kir Thr Lys Val Phe Tyr He Gly : Leu Arg Gly Glu Trp Thr 

' ^ • ^^165" • 170 ^' V '' : 175 

<31u Leu Srg His Glii Val'l^ 

V-. ; ,.v- ; ;.,,: * 8a > 18? . , /.;*?:9.:V-:^ I - : ...y.r V v: V-V;^- 

Asn Pro Ala Asp His Arg Val His Glji Val .Thr , Prp r Glir Thr~ His Phe 

195* • "" 200 *"'"' s ' f " ^ 205 rvs " 

' lie Ser '-'h^H- :K- ^ - - > '"' V-' • ;;r''* ■ ;> ^?^V^^ > " r"-^^^^ 

. , . 2io> v-.-."-.: ;:• . • . . . . .., . • . ,.' V- ;.. . 

(2i INFORMATION FOR SEQ ID NO: 31: . * ' ~ > 



■f 
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(i) SEQUENCE CHARACTERISTICS : 
<A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: , linear., . v -V-tV/v t \ ', - ■■■ h.' : / '-" : * V T' : 

(xi) SEQUENCE DESCRIPTION : SEQ IP NO: 31 : ^ 
CTGGCCTACG GAAGATACGA CAC ; \. t l -. 23 

(2) INFORMATION FOR SEQ ID NO: 32 : j T _ v . : • 

% (l) SEQUENCE CHARACTERISTICS : . ■ 
IM LENGTH: 23 base pairs 
? . (L) TYPE: nucleic acid v \U^'U ^-W^^f^ 
>; - ^ ^ <C> S-TRANDEDNESS : single' ' " \ \ 'V . 

<D) TOPOLOGY: linear •• : . ;^v. . -^v- v.? '^^ :V ' ; - " 

, (xi ) SEQUENCE DESCRIPTION : SEQ It). -NO : ^2 : V 
ACAATCCGGA GGCATCAGAA ACT 23 

(2) INFORMATION FOR SEQ ID... NO,:, .33 : y. ; f * : ' ~ V 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE : - nucleic aLcid 

(C) STRANDEDNESS : single r. . v ; ' 

: :t \. . .^-^r;.. r Uiiy^^^^^^?: '.a^^a^ 7 ""'"- '- : ■ lV " v;;::, - " ' . • 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
AGCCCCGGCC TCCTCGTCCT C 21 

: ( 2 ) r INFQRMAT I ON - WQRi • S^Q : 3I> >NO^ 34 :y. ^V' ? 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single - , 
(B) TOi>OiiG)GY: linear 



J (xi6 • SE^uffifCS DE^^tPTlQN^ SEQ ID NO : 34 : 



^ #^r^& i* 
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CLAIMS •• 



I. 



A method for identifying markers for a disease stale, comprising the following steps* 
a) pr„ vldmg a fctset of p^^, ^ orWsubiecJL 

normal subjects; , rmore 

h) amplifying both sets of mRNAs ,„ p ro vide nucleic acid ampfification product 
c) companng said sets of amplification products; and ' 

10 subi J° d j******" 0 -""' ^'-^.M-Xexpresse*^ 
1« subjects and subjects exhibiting said disease state- 

whetein a difference in « of-expression of an mRNA is fflicauve of a disease mar^ 
2- The method of claim 1, former defined 



as comprising the step of using said mRNAs 
.empiatesforDNA^mesisinareverse^nscrip^erOaction. 



as 



3 "herein^ 

^mscuous obgonucleofide primers, anchoring pHmers or a combiPafion of met are used " 
primers m the reverse transcriptase reaction. 



•> TTie method of claim 1, herein arbiti^i.yehosen oligonucleotides ... 



, promiscuous 



5. 



The method of Cairn 1, wherein the disease state is metastatic or organ commed cancer 
•^lupuservuren.fosis,menma,oid^ ' 
-omnnunemvroimtis.amv^ 

«• Hie me-h*, of claim 5, wherein me disease state is metns-atic prostate cancer. 
» 7. ^tneu^ofclaimS.wlK-r^^ 
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8. The method of claim 1 , wherein said subjects are laboratory animals. 

9. The method of claim 1 , wherein said subjects are humans. 

1 01 ; A method of detecting a metastatic cancer disease state in a subject, comprising the steps 
of: 

a) detecting the quantity of expression of a metastatic cancer disease marker 
expressed in peripheral blood of said subject; and 

b) comparing the quantity of expression of said marker in peripheral blood of said 
subject to thW quantity of said marker expressed in peripheral blood of one or more normal 
subjects; 

wherein a difference in quantity of expression of said marker in peripheral blood of said subject 
relative to quantity of expression of said marker in peripheral blood of said one or more normal 
individuals is indicative of a metastatic cancer disease -state. 

1:1 i The method of claim 10, Wherein said disease marker is an mRNA. 

1 2. The method of claim 11 /wherein said ifiRNA is arhplifieid by an RlsfA polymerase 
reaction. - : •' ' • • 

1 $M llii^meti^ ^el^iiri 1 l;i herein ^tfttiR^ ^tfified by tdvetsc transcriptase 
jk)lyme^e cAai^ 

14. The'ih^thod of claim 10, wheffciri ^aid detectiilg ik by £N A fingerprinting, branched DNA 
or nuclease proteption assay. 

1.5. The method of claim 10, wherein said metastatic cancer disease state is metastatic 
prostate cancer. 



5 



10 



15 



20 
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16. The method of claim 10, wherein said metastatic cancer disease state is metastatic breast 
cancer. 

17. The method of claim 1 1 in which said mRNA comprises one or more of the sequences or 
the complements of the sequences disclosed herein as Genebank Accession numbers D87451 
TO301X X03558. M28130, Y00787, SEQ ID NOM, SEQ ID NO:2, SEQ ID NO:3, SEQ ID ' 
NO:4. SIX.) II ) NO:5 or SEQ ID NO:29. 

18. 1 hcmctlKKlol claim ;- I0 i„ which ^ is a productof me mterleuJdn^gene. 

19. The method of claim 10, wherein said metastatic cancer disease marker is identified by 
the method of claim I. 

20. The method of claim 11, further defined as comprising the steps of 

a) providing primers that selectively amplify at least a portion of said disease state 
marker; 

b) amplifying^ 
amplification products; 

c) detecting said nucleic acid amplification products; and - 

d) measuring the amount of said nucleic acid amplification products formed. 



2 k > methpd « f c,aim 20 in which said primers are selected to produce an amplicon 
having a sequence of or complementary to a sequence of at least a 50 base conUgupns, segment of 
Genebank Accession numbers D87451, T03013, X03558, M28130, Y00787, SEQ ID NO:l, 
25 SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NQ:5, or SEQ ID NOr29. \ v 

22. The method of claim 21, wherein said amplicon is from about 50 to about 500 bases in 
length, 
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23. The method of claim 2 1 , wherein said amplicon is from about 1 00 to about 4 1 5 bases in 




24. The method of claim 10, wherein said metastatic cancer disease marker is a polypeptide: 

5 • ; : r v-- ?i - " . . c r :.- 

r 

>25. The metlibd of claim 24, wherein said polypeptide is encoded by a nucleic acid sequence, 
comprising the sequence disclosed herein as Geriebank Accession numbers D87451, T03013, 
X03558, M28 1 30,* Y()6787, SEQID NO: 1 , SEQ ID NO:2, SEQID NO:3, SEQ IDNO:4, SEQ 

. ID NO:5, or SEQ ID NO:29. 

26/ TOe method of claum 24, wherein said detection comprises antibody immunoreaction with 
said polypeptide. 

27: The method of claim 26, wherein said detection comprises an ELISA, an 
1 5 immunoprecipitation, a radioimmunoassay, an immunohistochemical, Western blotting, dot 
blotting, or FACS analyses. 

28. The method of claim 24, wherein said polypeptide is encoded by the 1L-8 gene. 

20 29. The method of claim 10 or claini 24, wherein said marker is a product of the IL-8 gene 

and wherein said comparison is between two alternatively spliced forms of an IL-8 gene product. 

30: Tbe method tiff cMm 2^ ^^reth the quaht^ of IL-8 polypeptidie in peripheral blood is 
measured using an in vitro bioassay that detects at least one 11^8 mediated bi^logicjil^fdeess. 

mi '/* Method of ca 29 saitf mid^ 

Y00787, SEQ ro NO:4 and SEQ ID NO:5. 

32/ s ^^^^^^indastil^e for prognosis ortJia^tisisof a dis^^ tonditio^n, whereM 1 SSftfdisease 
30 toarkfer is identified by a jproc^ conaprising: ■ U ; > ' ^ 
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a) providing a first set of peripheral blood mRNAs ftom one or more subjects known 
to exhibit said disease state and a second set of peripheral blood mRNAs from one or more 
normal subjects; 

b) amplifying both sets of mRNAs to provide nucleic acid amplification products; 
5 c) comparing said sets of amplification products; and 

d) identifying those mRNAs that are differentially expressed between normal 
subjects and subjects exhibiting said disease state; 

wherein a difference in quantity of expression of an mRNA is indicative of a disease marker. 

10 33. The disease marker of claim 33, wherein the disease state is metastatic or organ confined 
cancer, asthma, lupus erythematosis, rheumatoid arthritis, multiple sclerosis, myasthenia gravis, 
autoimmune thyroiditis, amyotrophic lateral sclerosis, interstitial cystitis or prostatitis. 

34. The method of claim 32, wherein the disease state is metastatic prostate cancer. 

15 r . , - : 

35. The method of claim 32, wherein the disease state is metastatic breast cancer. 

36. The method of claim 32, wherein said subjects are laboratory animals. . 
20 37. The method of claim 32, wherein said subjects are humans. 

38. A method of detecting prostate cancer in a biological sample, comprising: 
:« •] ..-^:'.V- meas;tj^g the levels pf II>8 in qombiiiation with at least one prostate disease 

: { mato in said sarnie; apd 
25 (b) comparing said levels with corresponding levels obtained from reference 

v ^ ^ of normal indi™^ 

39. The method of claim 38 in which said prostate disease marker is selected from a group 
^^ep^sta^ total, prostaite ^ecific antigen (PSA); prostate specific membrane antigen 

30 (PSMA=Folic Acid Hydrolase); prostate acid phosphatase (PAP); prostotic ^i^to^ protein 
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(PSP94); human kallekrein 2 (HK2); and the ratio of the concentrations of free and bound forms 
ofPSA(f7tPSA). 

40. The method of claim 38 in which the biological sample Comprises peripheral human 
""S' : blbodi ' '"■ *. ■ 

4 1 . The method of claim 38 wherein the level of IL-8 in a biological sample is measured 
rising at least onfe antibody that binds to at least oiieIL-8 gene product; ^ 

1 0 42. The method of claim 4 1 wherein the level of IL-8 gene product bound to airitibody is 
measured by ELISA. 

43 * The method of claim 3 8 wherein the level of IL-8 in a biological sample is measured 
using at least one oligonucleotide probe that binds to at least one IL-8 messenger RNA (mRNA). 

15 ' ' 1 • y- . - ■ . ■ ' ' •-" ■~- : ' [ \<- 

44; The method of claiiti 43 wherein the IL-8 mRNA is alternatively Spliced' to include intron 

3. * - . " -< ' • '• '* ' - - 

45 The method 6^ 

20 is measured by nuclease protection assay. ^ - 

46 ThfctoSthodoih^i^^ to H>8 mRNA 
- is measured by RT-PCR™. ■ *P £ 

25 *»7v The niethod of claiiib 43 Wherein the level of oligonucleotide probe^k)tirid to IL-8 mRNA 
is measured .by ligase chain : t^a^S^^^:^^j- : »>U^. : ' : ./;/-..^ ; ^jOxV^v- 

- 48r The<ni^ 

is measured by PCR™. <'i^)^ ; ;x.y^:y- / r K- ^H* : 
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49. The method of claim 40 wherein the level of IL-8 in a biological sample is measured 
using an in vitro bioassay that detects at least one IL-8 mediated biological process. 

50. The method of claim 44 wherein the level of IL-8 in a biological sample is measured 

5 using at least one molecule that binds to an IL-8 gene product, wherein said molecule is selected 
from a group consisting of: an IL-8 binding protein; and an IL-8 receptor protein. 

5 1 . The method of claim 48 whferein the level of prostate disease marker in a biological 
sample is measured using at least one antibody that binds to at least one prostate disease marker 

10 protein. 

52. The method of claim 51 wherein the level of prostate disease marker protein bound to 
antibody is measured by ELISA. 

15 53. The method of claim 39 wherein the level of prostate disease marker in a biological 

sample is measured using at least one oligonucleotide probe that binds to at least one prostate 
disease marker messenger RNA (mRNA). 

.. > 54. The method of claim 43 wherein the level of oligonucleotide probe to]imd to prostate 
20 disease marker mRNA is measured by nuclease protection assay. 

/i\y>>i- ?i55.;.- . ; T^e.method of claim 43 wherein the l£¥el of oligonucleotide probe bound to prostate 
disease marker mRNA is measured by RT-PCR™. V 

2§ ; <; f 56. ; Tht^mejthod of claim 43 wherein the level of oligonucleotide probe bound to prostafe ; c 
disease marker mRNA is measured by ligase chain reaction* 

AVl >;:u57-^ 1 7 'l^<m^od-Of j^imi 43 wherein the le^ pilpligonuele^^ prostate 
disease marker toRNA is measured by PCR™. 



J>1> 
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58. A method of differentially diagnosing prostate cancer and benign prostatic hyperplasia, 
comprising the step of measuring the levels of IL-8 in combination with at least one prostate 
disease marker in a biological sample. 

5 591 The method of claim 58 in which said prostate disease marker is selected from a group 
corisistmgof: total prostate specific antigen (PSA), prostate specific niembrah6 antigen 
(PSMA=Folic Acid Hydrolase), prostate acid phosphatase (PAP), prostatic secretory proteins 
(PSP 94 ), human kallekrcin 2 ( IIK2), and the ratio of the concentrations of free and bound forms 
of PSA (fft PSA). 

60. The method of claim 59 in which said biological sample consiists of peripheral human 
blood. 

61. A kit for use in detecting a human disease, comprising: 

(a) a pair of primers for amplifying a disease state marker consisting of a nucleic 
acid; and 

(b) containers for each 6f said primers. 

62. A Idt according to clrn^ the pair of piimersis Selected tb kn^lify k nucleic 
20 acid marker for metastatic human cancer. 

<W$$l- J - ^ kitksCbrdiMg^) claini62 in \^hich the pair of pn^eir^is selected to amplify a^ueleic 
&cid having a sequence comprising at least a 50 base segment of Genebank Accession numbers 
D87451, T03013, X03S58, M281 30, Y00787, SEQ1D NO: 1, SEqMnO:2> SEQl&NO:3, 
25 SEQIDNO:4,SEQIDNO:5,orSEQIDNO:29. / ; • S 

641 ' A kit according to claim 62, etrniprising*: - r •■■ 'v., ; 1 " A - ^ :• . 

; ^ aitfplify ariuc^ H) 
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(b) a pair of primers selected to amplify a nucleic acid sequence comprising SEQ ID 
NO:5 or Genebank Accession # M28 1 30. 

65. A kit for use in diagnosing metastatic cancer in a biological sample, comprising: 

(a) an antibody which binds with high specificity to a polypeptide having an amino 
acid sequence encoded by a nucleic acid sequence comprising Genebank Accession numbers 
D87451, T03013, X03558, M28130, Y00787, SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, 
SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:29. 

(b) a container for said antibody. 

66. A kit according to claim 65, further defined as comprising: 

(a) an antibody that binds with high specificity to a soluble EL-8 gene product; 

(b) an antibody that binds with high specificity to a membrane bound IL-8 gene 
product; and 

(c) a container for each antibody, 

67. A kit according to claim 65, wherein said metastatic cancer is metastatic prostate cancer. 

68. A kit according to claim 65, wherein said metastatic cancer is metastatic breast cancer, 

69. A kit for detecting or differentially diagnosing human prostate cancer, comprising: 
(a) at least one detection agent for measuring the levels of IL-8 in a biological sample; 

; : at least one detection agent for measuring the levels of at least one prostate disease 
imarkCT in said biojp^cal sample; and ■-. , U jn 

(c) containers for each of said detection agents, . - ;? . -'y--/}/-C\.; 

- . - \ ... 

70. The kit of claim 69 in which said prostate disease marker is, selected from a group 
comisting of: to^d prpstate specific antigen (PSA), prostate specific membrane antigen 
(PSMA=Folic Acid Hydrolase), prostate acifl phosphatase (PAP), prostatic secre^tpiy proteins 
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(PSP^), human kallekrein 2 (HK2), and the ratio of the concentrations of free and bound forms 
ofPSA(f/tPSA). 

7K The kit of claim 70 in which said detection agents are selected from a group consisting 
5 of: polyclonal antibodies; monoclonal antibodies; oligonucleotides; paired oligonucleotides 
designed to bind to opposite strands of a double-stranded DNA molecule; and at least one 
molecule that binds to an IL-8 gene product 

72. The method of claim 16 in which said breast cancer marker is selected from a group 
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Relative Quantitative RT-PCR Showing 
Di»erential:Expression ol IL-8 (=UC32*1 
in peripheral blood ol patients with 
Metastatic Prostate Cancer (M) and 
Normal Individuals (N) at different 
PCP, cycles (cy) - 
25 cy 2S cy 31 cy 

N M £. M N M imp..,. 



ooserved The Upper band (int.+) includes lntron=3 
•n the mature mRNA. Int.- lacks intron =3 



f »^30 Sec. 



> 
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Relative Quantitative RT-PCR showing 
Differential Expression of IL-8 (=UC325) 
in peripheral blood of patients with 
Metastatic Prostate Cancer (1-5) and a 
Poo! of Normal Individuals (N) 



intron 3+ 



N 



3 4 5 no temp. 



intron 3- 



Two alternatively spliced forms of the IL-8 
mRNA are observed (1-5) are different 
Individuals with metstatic prostate cancer 



: w ii^^s^c-:^: nits «r\to.Ttr 
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Figure 2 

Ability of Total PSA (ng/m 1) to Distinguish BPH and 
Stages A , B , & C Prostate Cancer (n = 142) 
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Figure 3 

Ability vfC^rectih Free/^dtal IPSA Ratio to Distinguish 
BPH and.S^ (n = 142) 
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bility of U C 3 25 (p g/m 1) to D isiin guish BPH an d 
Stages A, B, & C Prostate Cancer (n = 14 2) 

A rea U nder the C»r V e: 0.7973 
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Figure 5 



A bility of U C325 (pg/m 1) & T-PSA (n g/m I) to D istingu ish 
BPH and Stages A, B, & C Prostate Cancer (n = 142) 
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Figure 6 

Ability of U C 3 2 5 (pg/m I) & fit PSA Ratio to D istinguish 
B P Hand S tages A , B , & C P rostate C a n cer (n = 1 42) 

Area Under the Carve; 0.8784 
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